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^ (57) Abstract: Method of expressing recombinant protein in CHO cells, by using an expression vector compnsmg the munne IgG 
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Method of expressing recombinant protein in CHO cells 

The present invention relates to a method for expressing a recombinant product gene in a 
CHO cell line as well as to recombinant CHO host cells and to novel expression vector 
constructs. 

The Chinese Hamster ovary cell (CHO) mammalian expression system is widely used in 
production of recombinant protein. Apart from lymphoid cell lines such as hybridoma cell 
lines, it is one of the few cell types allowing for simple and efficient high-density 
suspension batch culture of animal cell. Furthermore, they allow for very high product 
yields and are comparatively robust to metabolic stresses whereas lymphoid cells are more 
difficult to culture at an industrial scale. Given considerable cost of production, it is of 
utmost importance to maximize the yield of recombinant protein per bioreactor run. Choice 
of culture medium composition and bioreactor design and operation are parameters that 
impact yield and may be quite complex to optimize. More predictably, increases in the 
strength or transcriptional activity of the promoter controlling expression of product 
protein enhance yield, hicremental mcreases at the single cell level will translate into 
considerable improvements of product yield in high-density batch or fed-batch culture 
showing stationary phase gene expression at cell densities in the range of 10^ to 10^ 
cells/ml. 

US5,866,359 describes a method of enhancing expression from an ahready strong hCMV 
promoter in CHO and NSO cells by co-expressiong adenoviral El A protein from a weak 
promoter. El A is a multifunctional transcription factor which may act on cell cycle 
regulation and has both independent transcriptional activating and repressing functional 
domains. The finetuning of El A expression to appropriate low level expression is crucial 
for success of the co-expression approach in order to achieve the ideal balance in between 
gene transactivation whilst avoiding any negative impact on cell cycle progression. 
As a disadvantage, apart from careful choice of the promoter driving El A expression, this 
system blocks part of the protein synthesis capacity of the cell with El A expression rather 
than expressing the recombinant protein of interest 

WO 95/175 16 describes use of the murine inununoglobulin gamma 2A locus for targetting 
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an expression vector construct to a highly active gene locus in lymphoid cells of the B-cell 
lineage, e.g. widely used NSO myeloma cells. NSO cells essentially are a tumor cell line of 
murine plasma or B-cells. Only in B-cells, the chromatin harboring the immunoglobulin 
loci is in its fully active, open state, allowing for high transcriptional activity of native 
immunoglobulin promoters or recombinant expression constructs integrated into those 
gene loci. 

As a disadvantage, due to the principle of homologous recombination, the targetting 
sequence will target efficiently in murine cell lines only matching the sequence of the 
ganuna 2 A targetting sequence harboring a recombinatorial hot spot; for high level 
expression, the gamma 2A locus region must be a transcriptionally active genomic region, 
limiting its effectiveness for homologous recombination to B-cell types. 

It is an object of the present invention to devise another expression system for CHO 
protein expression in biotechnology which allows for enhanced expression from a standard 
promoter. According to the present invention, this aim is surprisingly achieved by 
equipping a gene expression vector for CHO cells with a gene targetting sequence having 
been originally devised for homologous recombination in murine B-cells. 



Possible embodiments of the invention are shown in the figures. What is shown is: 

Fig. 1 Relative expression levels of green fluorescent protein (GFP) fi:om hCMV promoter 
and hCMV promoter in the presence of the IgG 2 A hot spot sequence m transient 
transfection of CHO-Kl cells 

Fig. 2 Relative GFP expression levels from hCMV promoter and hCMV promoter in the 
presence of the IgG 2A hot spot sequence in stably transfected CHO-Kl cells. 

Fig. 3 Plasmid map of hCMV-MDE expression vector carrying IgG 2A targetting sequence 

According to the present invention, a DNA sequence for expression of a recombinant gene 
in a mammalian cell comprises a recombinant product gene and a promoter for expressing 
the recombinant product gene, preferably a CMV promoter, and fiirther comprises a 
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murine immunoglobulin gamma 2A locus DNA sequence or fragments or sequence 
variants thereof capable of enhancing expression from the promoter. According to the 
present invention, such a DNA sequence is useful expression vector constmct for 
expression of recombinant product gene in CHO cells. 

According to the present invention, the method of expressing a recombinant protein 
comprises the steps of 



a. culturing a CHO cell transfected with an expression vector comprising a promoter 
10 active in CHO cells driving expression of a recombinant product protein and further 

comprising the murine IgG 2 A gene locus DNA or a DNA sequence variant or 
DNA fragment thereof which is enhancing activity of said promoter, and 

b. harvesting the product protein 

15 A recombinant product gene according to the present invention is the product protein that 
is sought to be expressed and harvested in high amoimt. It may be any protein of interest, 
e.g. therapeutic proteins such as interleukins or eniymes or subunits of multimeric proteins 
such as antibodies or fragments thereof. The recombinant product gene may include a 
signal sequence coding sequence portion allowing secretion of the once expressed 

20 polypeptide from the host producer cell. In a fiuther prefered embodiment of the present 
invention, the product protein is a secreted protein. More preferably, the first or product 
protein is an antibody or engineered antibody or a fragment thereof, most preferably it is an 
InMnimoglobulin G (IgG) antibody. 



The DNA sequence of the murine immunoglobulin gamma 2 A gene locus (IgG 2A) has 
originally been devised in WO 95/17516 for use as a genomic targetting sequence for 
generating stably recombinant lymphoid B-cell lines that show high expression of the 
recombinant gene product. B lymphocytes or plasma cells normally express extremely high 
30 levels of immunoglobulin RNA from the the Ig heavy chain locus, probably due to cell- 
type specific enhancer/transcription factor activity and open chromatin structure. The 
preferred murine immunoglobuline gamma 2 A gene sequence of the present invention is 
the same as the targetting sequence used in WO 95/17516. It is a 5,1 kb BamHI genomic 
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fragment which includes aU of the coding region of murine Ig ganuna 2A except the most 
5' part of the CHI exon (Yamawaki-Kataoka, Y. et al., Proc. Natl. Acad. Sci. U.S.A. 
(1982) 79: 2623-2627; Hall, B. et al.. Molecular Immunology (1989) 26:819-826; 
Yamawaki-Kataoka. Y. et al.. Nucleic Acid Research (1981) 9: 1365-1381). According to 
the present invention, promotion of site-directed, homologous recombination is not the 
relevant property of the immunoglobulin gamma 2A gene sequence (IgG 2A). 
Accordingly, any sequence variant of said IgG 2A gene sequence or sequence fragment or 
variant sequence fragment that is functional in or capable of enhancing recombinant 
product gene expression from the promoter, preferably from a hCMV promoter as set 
forth below, both under condition of transient or stable expression in CHO cells is also 
encompassed by the present invention. 

Such ,fimctional' variants encompass e.g. base insertions, deletions or point mutations and 
be generated by methods well-known in the art. e.g. by primer-directed PGR, 'error-prone' 
PGR, 'gene-shufifling' termed PGR-reassembly of overlapping DNA fragments or by in- 
vivo random mutagenesis of bacterial clones followed by library transfection and 
frmctional selection in CHO cells. For instance, random mutagenesis can be achieved by 
alkylating chemicals or UV-irradiations as described in Miller, J., Experiments in 
Molecular Genetics, Cold Spring Harbor Laboratory 1972). OptionaUy, a natural mutator- 
strain of a host bacterium may be used. 

Preferably, such variant sequence or sequence fragment is at least 65%, more preferably 
75%, most preferably 90% homologous in DNA sequence to the corresponding part of the 
natural murine immunoglobuline gamma 2 A gene locus. For instance, it is possible to 
insert a Sal I restriction site at the nattirally occurring Stu I site present 39 bp upstream of 
membrane exon 2 (M2) to provide a unique site for linearization within the murine 
immunoglobulin gamma 2A sequence; such sequence variant was originally devised for 
site-specific recombination targetting, but can as well be employed in the context of the 
present invention. 

A 'promoter' is defined as a DNA sequence that directs RNA polymerase to bind to DNA 
and intiates RNA synthesis. According to tiie present invention, it is a promoter that is 
active in CHO cells. Such a promoter preferably is a strong promoter. A strong promoter is 
one which causes mRNAs to be initated at high firequency equal to or higher than tiiat of 
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hCMV core promoter/enhancer fragment (described in US5168062) in CHO-KI cells. Such 
promoter may be a cell-type dependent strong promoter, as are cited in US5589392, or 
preferably is a ubiquitously active strong promoter, more preferably a constitutively active 
viral promoter such as e.g. early and late promoters of the SV40 virus, the immediate early 
promoter of the human cytomegalovirus (hCMV) or of murine cytomegalovirus (mCMV), 
the thymidine kinase promoter (TK) of Herpes Simplex virus or the Rous Sarcoma Virus 
long terminal repeat promoter (RS-LTR), more preferably it is the hCMV-MIE promoter 
as defined by the 2.1 kb Pst I fragment described in US 5,385,839 and/or EP-323 997-Al 
or a fimctional part thereof having promoter activity. The hCMV promoter construct 
harboring the complete first fimctional intron of the major immediate early (MIE) gene of 
hCMV, as set forth in EP-323 997-Al , is a particularly prefened embodiment of the 
present invention. 

Preferably a hCMV promoter employed m the present invention lacks the 'modulator- 
sequence part in the upstream/enhancer portion of the promoter. The ' modulator' 
sequence has been found to be detrimental to hCMV promoter activity in CHO cells and 
stretches from position -750 to position -1150 relative to the MIE transcription start site 
(Meier et al ., 1996, Interviiology 39: 331-342, Regulation of hCMV immediate-early 
gene expression), in particular in transient transfection. Without the modulator sequence, 
the enhancing effect of the presence of the IgG 2A host spot sequence on (modulator 
negative or mod- for short) hCMV promoter is even more pronounced. 

A transient transfection is characterised by non-appliance of any selection pressure for a 
vector borne selection marker. A pool or batch of cells originating from a transient 
transfection is a pooled cell population that comprises cells which have taken up and do 
express and cells that have not taken up the foreign DNA. Cells that express the foreign 
expression cassette do usuaUy not have integrated the transfected DNA into their genome 
yet and tend to lose the foreign DNA and to overgrow transfected cells in the population 
upon culture of the transiently transfected ceU pool. Therefore expression is strongest in 
the period inmiediately following transfection and decreases with time. Preferably, a 
transient transfectant according to the present invention is understood as a cell that is 
maintained in cell culture in the absence of selection pressure up to a time of 90 hours post 
transfection. 
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Preferably, a transfected CHO host ceU according to the present invention is a stably 
transfected host cell, in particular in combination with a hCMV promoter as set forth 
above. Stable transfection means that newly introduced foreign DNA is becoming 
incorporated into genomic DNA. usually by random, non-homologous recombination 
events; in case of a vector sequence, stable transfection according to the present invention 
may result in loss of vector sequence parts not directly related to expression of the 
recombinant product gene, such as e.g. bacterial copy number control regions rendered 
superfluous upon genomic integration. A transfected host cell has integrated at least part or 
different parts of the expression vector into the genome. Likewise, transfection of CHO 
ceUs with two or several DNA fragments giving rise at least in vivo to functional 
equivalents of the essential elements of the expression vector of the invention, namely the 
product gene under control of a suitable promoter and the hot spot IgG 2A sequence, is 
contained in the defmition of such transfected host cells, hi vivo assembly of functional 
DNA sequences after transfection of fragmented DNA is described e.g. in WO 99/53046. It 
is possible that such stable integration gives rise, upon exposure to further selection 
pressure for gene amplification, to double minute chromosomes in CHO cells. This is 
comprised in the present meaning of 'stable'. Upon random genomic integration of the 
expression vector of the present invention in CHO, the presence of the targetting sequence 
enhances promoter, activity for expression of the recombinant product protein. Such effect 
has not been observed nor could it have been anticipated upon homologous gene targetting 
in mature murine B-cell lines including plasmacytoma/myeloma cell lines; there, the IgG 
2 A targetting sequence served solely to increase the frequency of high-yielding 
homologous integrants smce the IgG 2 A locus proved to be a recombinatorial 'hot spot'. 
As said before, the chromatin of the immunglobulme genomic region is m an open, highly 
active state in suitably targetted B-cell lines. 

'Expression vectors' are defined herem as DNA sequences that are required for 
transcription and the translation of then: mRNAs in an appropriate mammaUan host cell 
line after transfection with vector. An appropriately constructed expression vector should 
usually contain: at least one expressable marker selectable in anunal cells, a limited 
number of usefiil restriction sites for insertion of the expression cassette for the 
recombmant product gene under control of an upstream promoter region. Where used in 
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particular for transient/episomal expression only, it may further comprise an origin of 
repUcation such as origin of Eppstein Barr Virus (EBV) or SV40 virus for autonomous 
repUcation/episomal maintenance in eukaryotic host ceUs but may be devoid of a selectable 
marker. Expression vectors are e.g.. but are not limited to. linear DNA fragments, DNA 
fragments encompassing nuclear targetting sequences or are speciaUy optimized for 
interaction with transfection reagents, animal viruses or suitable plasmids that can be 
shuttled and produced in bacteria- Any selection marker commonly employed such as 
thymidine kinase (tk), dihydrofolate reductase (DHFR) or glutamine synthetase (GS) may 
be used. In a preferred embodiment, an expressable GS selection marker is employed 
(Bebbington et al., 1992, High-level expression of a recombinant antibody from myeloma 
cells using a glutamine syntiietase gene as an amplifiable selectable marker, 
Bio/Technology 10:169-175; Cockett et al., 1990, High level expression of tissue inhibitor 
of metalloproteinases in Chinese Hamster Ovary (CHO) cells using Glutamine synthetase 
gene amplification. Bio/Technology 8: 662-667). - The GS-system is one of only two 
systems that are of particular importance for the production of therapeutic proteins. In 
comparison to the dihydrofolate reductase (DHFR) system, the GS system offers a large 
time advantage during development because highly productive cell lines can often be 
created from the initial tranfectant thus avoiding the need for multiple rounds of selection 
in the presence of increasing concentrations of selective agent in order to achieve gene 
amplification (Brown et al., 1992, Process development for the production of recombmant 
antibodies using the glutamine synthetase (GS) system, Cytotechnology 9:231-236). It 
goes without saying that equivalent to a second transcription unit for expression of the 
marker gene, an expression unit could use a monocistronic expression cassette both for the 
product gene and the marker gene by employing e.g. intemal ribosome entry sites as is 
routinely employed m the art Vice versa, it goes without saying that the hot spot IgG 2 A 
sequence of the present invention and the expression cassette for the product protein 
comprising a promoter and/or marker cassette are not required to work in cis on a single 
expression vector; the elements can be well carried on separate co-transfected vectors or 
DNA fragments which may then be chromosomally integrated at a single, concatemeric 
integration site. 



A further object of the present invention are CHO host cells transfected with the DNA 
sequences of the present invention. Further objects are a method for transfection of su< 
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host cells and a method for expression of the recombinant product gene in such host ceUs. 
The explanations and references made to preferred embodiments in the present 
specification of the invention relate likewise to all these fiirther objects of the present 
invention. It is to be noted that a host cell transfected with the DNA sequence or vector of 
the present invention is to be construed as being a transiently or stably transfected ceU line. 
Any transfection technique such as those well-known in the art, e.g. electoporation, Ca- 
phosphate precipitation, DEAE-dextrane transfection, lipofection, can be employed 
according to the present invention if appropriate for a given host cell type. 

A suitable host cell line can be any Chinese hamster ovary (CHO) cell line (Puck et al., 
1958, J. Exp. Med. 108: 945-955). The term 'host cell' refers to cells capable of growth in 
cultiire and expressing a desired protein recombinant product protein. Suitable cell lines 
can be e.g. CHO Kl (ATCC CCL-61), CHO pro3-, CHO DG44, CHO P12 or the dhfi:- 
CHO ceU line DUK-BH (Chassin et al., PNAS 77, 1980, 4216-4220) or DUXBll 
(Simonsen et al., PNAS 80, 1983, 2495-2499). In CHO cells, the immunoglobuline gene 
loci are inactive and the chromatin is therefore in a densely packaged or closed state. Thus, 
any gene consfa^ct integrated in the immunoglobuline loci could not give rise to high-level 
expression of recombinant protein due to the specific state of chromatin, unless it would 
itself comprise flanking locus control regions promoting opening of the chromatin on both 
sides of the expression cassette. Further, immunoglobuline gene sequence, and in 
particular the intron portions of it, show considerably divergence amongst species, e.g. 
ftom mouse to hamster. The promoter or enhancer elements of immunoglobline loci are 
both species and tissue specific and should be active in B-cells only. The murine IgG 2A 
sequence of the present invention enhances gene expression in CHO cells also in the 
absence of any natural immunoglobuline promoter that is giving rise to fiiU-lengtii 
transcripts coding for complete IgG heavy chain. Preferably, the IgG 2A sequence of the 
present invention is devoid of such promoter. Surprisingly, the murine IgG 2 A targetting 
sequence even improved gene expression in CHO cells upon transient ti^sfection of CHO 
cells with expression vectors according to the present invention (Fig. 1); such tiansient 
expression is a further preferred embodiment of a method according to the present 
invention. In transient expression assays which are commonly taking place about 20-50 
hours post transfection, the ti^sfected vectors are maintained as episomal elements and 
are not yet integrated into the genome. 
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Suitable media and culture methods for mammalian cell lines are well-known in the art, as 
described in US 5633 1 62 for instance. Examples of standard ceU culture media for 
laboratory flask or low density cell culture and being adapted to the needs of particular cell 
types are for instance: RosweU Park Memorial Institute (RPMI) 1640 medium (Moire, G., 
The Journal of the American Medical Association, 199, p.519 f. 1967), L-15 medium 
(Leibovitz, A. et al., Amer. J. of Hygiene, 78. Ip.l73 1963), Dulbecco's modified 
Eagle's medium (DMEM), Eagle's minimal essential medium (MEM), Ham's F12 
medium (Ham, R. et al., Proc. Natl. Acad. Sc.53, p288 ff. 1965) or Iscoves' modified 
DMEM lacking albumin, transferrin and lecithin (Iscoves et al., J. Exp. med. 1, p. 923 £f., 
1978). For instance. Ham's FIO or F12 media were specially designed for CHO cell 
culture. Other media specially adapted to CHO cell culture are described in EP-481 791. It 
is known that such culture media can be supplemented with fetal bovine serum (FBS, also 
called fetal calf serum FCS), the latter providing a natural source of a plethora of hormones 
and growth fectors. The cell culture of mammalian cells is nowadays a routine operation 
well-described in scientific textbooks and manuals, it is covered in detail e.g. in R. Ian 
Fresney, Culture of Animal cells, a manual, 4* edition. Wey-Liss/N.Y., 2000. 

Preferably, the cell culture medium according to the present invention is devoid of fetal 
calf serum (FCS or FBS), which then is being termed *serum-&ee'. Cells in serum-firee 
medium generally require insulin and transferrin in a serum-fi«e medium for optimal 
growth. Transferrin may at least partially be substituted by non-peptide chelating agents or 
siderophores such as tropolone as described in WO 94/02592 or increased levels of a 
source of anorganic iron favorably in conjunction with antioxidants such as vitamin C. 
Most cell Unes require one or more of synthetic growth factors (comprising recombinant 
polypeptides), including e.g. epidermal growth factor (EOF), fibroblast growth factor 
(FGF), insulin Uke growth factors I and U aOFI, IGFH), etc.. Other classes of factors 
which may be necessary mclude: prostaglandins, transport and binding proteins (e.g. 
ceruloplasmin, high and low density Upoprotems, bovine serum albumin (BS A)), 
hormones, includmg steroid-hormones, and fatty acids. Polypeptide factor testing is best 
done in a stepwise fashion testing new polypeptide factors in the presence of those found 
to be growth stimulatory. Those growth factors are synthetic or recombinant. There a 
several methodological approaches well-known in animal cell culture, an exemplary being 



wo 2004/009823 




PCT/EP2003/007946 



10 

described in the following. The initial step is to obtain conditions where the cells will 
survive and/or grow slowly for 3-6 days after transfer from serum-supplemented culture 
medium. In most cell types, this is at least in part a function of inoculum density. Once the 
optimal hormone/growth factor/polypeptide supplement is found, flie inoculum density 
5 required for survival will decrease. In a more preferred embodiment, the cell culture 

medium is protein-free, that is free both of fetal serum and individual protein growth factor 
supplements or other protein such as recombinant transferrin. 

A possible embodhnent of one method of the present invention, namely expression and 

10 harvest of the recombinant product protein, is high-density grovs^ of the animal host cells 
e.g. in an industrial fed-batch bioreactor. Conventional downstream processing may then 
be applied. Consequently, a high-density growth culture medium has to be employed. Such 
high-density growth media can usually be supplemented with nutrients such as all amino 
acids, energy sources such as glucose in the range given above, inorganic salts, vitamins, 

15 trace elements (defined as inorganic compounds usually present at final concentrations in 
the micromolar range), buffers, the four nucleosides or their corresponding nucleotides, 
antioxidants such as Glutathione (reduced), Vitamine C and other components such as 
important membrane lipids, e.g. cholesterol or phosphatidylcholine or lipid precursors, e.g. 
choline or inositol. A high-density medium will be em^iched in most or all of these 

20 compoimds, and will, except for the inorganic salts based on which the osmolarity of the 
essentially isotonic mediimi is regulated, comprise them in higher amoxmts (fortified) than 
the afore mentioned standard media as can be incurred from GB2251 249 in comparison 
with RPMI 1640. Preferably, a laigh-density culture medium according to the present 
invention is balancedly fortified in that all amino acids except for Tryptophane are in 

25 excess of 75 mg/1 culture medium. Preferably, in conjunction with the general amino acid 
requirement, Glutamine and/or Asparagine are in excess of 1 g/1, more preferably of 2 g/1 
of high-density culture medixim. In the context of the present invention, high-density cell 
culture is defined as a population of animal cells having temporarily a density of viable 
cells of at least or in excess of 10^ cells/ml, preferably of at least or in excess of 10^ 

30 cells/ml, and which population has been continously grown from a single cell or inoculum 
of lower viable cell density in a cell culture medium in a constant or increasing culture 
volume. 
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In a further prefered embodiment, the fed-batch culture is a culture system wherein at least 
Glutamine, optionally with one or several other amino acids, preferably glycine, is fed to 
the cell culture as described in GB225 1249 for maintaing their concentration in the 
medium, apart from controlling glucose concentration by spearate feed. More preferably, 
the feed of glutamine and optionaUy one or several other amino acids is combined with 
feeding one or more energy sources such as glucose to the ceU culture as described in EP- 
229 809-A. Feed is usually initiated at 25-60 hours after start of the culture; for instance, it 
is useful to start feed when cells have reached a density of about 10*^ cells/ml. It is well 
known in the art that in cultured animal cells, 'glutaminolysis' (McKeehan et al., 1984, 
Glutaminolysis in animal cells,in: Carbohydrate Metabolism in Cultured Cells, ed. MJ. 
Morgan, Plenum Press, New York, pp. 1 1-150) may become an important source of energy 
during growth phase. The total glutamine and/or asparagine feed (for substitution of 
glutamine by asparagine, see Kurano, N. et al., 1990, J. Biotechnology 15, 1 13-128) is 
usually in the range from 0.5 to 10 g per 1, preferably from 1 to 2 g per 1 culture volume; 
other amino acids that can be present in the feed are from 10 to 300 mg total feed per Utre 
of culture, m particular glycine, lysine, arginine, valine, isoleucine and leucine are usually 
fed at higher amounts of at least 150 to 200 mg as compared to the other amino acids. The 
feed can be added as shot-addition or as contionusly pumped feed, preferably tiie feed is 
almost continously pumped into the bioreactor. It goes without saying that the pH is 
carefully controlled during fed-batch cultivation in a bioreactor at an approximately 
physiological pH optimal for a given cell line by addition of base or buffer. When glucose 
is used as an energy source the total glucose feed is usually from 1 to 10, preferably from 3 
to 6 grams per litre of the culture. Apart from inclusion of amino acids, the feed preferably 
comprises a low amount of choline in the range of 5 to 20 mg per litre of culture. More 
preferably, such feed of choline is combined with supplementation of ethanolamine 
essentially as described in US 6048728, in particular in combination with feeding 
glutamine. It goes without saying that upon use of the OS-marker system, lower amounts • 
of glutamine will be required as compared to a non-GS expression system since 
accumulation of excessive glutamine in addition to the endogenously produced would give 
rise to ammonia production and concomittant toxicity. For GS, glutamine in the medium or 
feed is mostly substituted by its equivalents and/or precursors, that is asparagme and/or 
glutamate. 
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It is a further, independent object of the present invention to devise an expression vector 
comprising at least a (first) transcription unit for a product gene, giving rise to product 
protein upon expression in a host cell, and which transcription unit is under the control of 
the mouse Cytomegalovirus promoter (mCMV promoter), and further comprising a second 
transcription unit comprising a glutamine synthetase (GS) marker gene. Such a product 
gene, or gene of interest (GOI) as it may be termed, can be e.g. an unmunoglobulin coding 
sequence. A glutamine synthetase marker gene is any enzymatically active GS coding 
sequence, be it a natural gene sequence or a variant thereof. The above definitions of 
'functional variant' as set forth above apply here as weU mcluding the preferred ranges of 
sequence homology. Preferably, the GS marker gene is a mammalian GS marker gene or 
derived thereof. Surprisingly, such expression vector aUows for much higher transfection 
rates upon transfection in CHO cells than does e.g. an expression vector in vMch the first 
transcription unit harboring the gene of interest is under control of the hCMV promoter. 
This despite the fact that m CHO cells, transcriptional activity of the mCMV promoter is 
much higher than that of hCMV promoter, usually it is believed that upon transfection, 
higher metabolic load reduces clonal survival upon transfection, resulting in lower 
numbers of transfectants . Thus the effect can not be correlated in an obvious manner with 
the amount or unexpected toxicity of product protein expressed, the latter possibly 
adversely affectmg growth of transfectants. Indeed, the finding is the very opposite of any 
expectation of a skilled person. 

Further objects according to the present invention are animal host cells, in particular CHO 
cells, transfected with such an expression vector which vector can be maintained 
episomally or can be stably integrated in the genome and a respective transfection method. 
Likewise, transfection of animal ceUs, in particular CHO cells, with two or more gene 
Augments giving rise in-vivo to functional equivalents of the transcription units of the 
present object of the invention, is withm the definition of such transfected host cells. 
Preferably, said host cells are stably transfected cells, meaning that the first and second 
transcription unit are chromosomally integrated. 

A further object is the use of mCMV promoter to enhance transfection rate in CHO cells, 
preferably when using an expression vector comprising at least a first transcription unit for 
a product gene which first unit is giving rise to product protein upon expression in a host 
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cell and which first transcription unit is further under the control of the mouse 
Cytomegalovirus promoter (mCMV promoter), and further comprising a second 
transcription unit comprising a glutamine synthetase (GS) marker gene. It may also be 
possible to transfect the first and second expression borne on different vectors, or as 

5 isolated gene firagments harboring individual expression units. Further, it may be possible 
to transfect a CHO cell that is already recombinant for and expresses GS with a first 
transcription unit harboring mCMV. According to the present invention, 'enhancing 
transfection rate' is defining by comparing transfection rate in the presence of the mCMV 
promoter and expression vector according to the present invention with the transfection 

10 rate of the same expression vector and host cell under identical transfection and cell culture 
conditions except that in the expression vector, the mCMV promoter is substituted to the 
hCMV-first intron enhancer/promoter constmct as defined in US 5658 759 and as set forth 
e.g. in sequence ID. No. 3 of the present invention. This hCMV-intron MIE-promoter 
construct, for a given identical product gene, serves as a standard for detennining the 

15 claimed effect of enhanced transfection rates. Preferably, use of mCMV promoter results in 
at least 10-times enhanced transfection rate. 

All relevant definitions given further above apply likewise to the present, independent 
objects of the invention. It must be stressed that the present object of the invention does not 
20 require the presence of the murine IgG 2 A targetting sequence as a prerequisite. 

Mmine cytomegalovirus (mCMV) is a member of the highly diverse group of 
herpesviridae. Even amongst cytomegaloviruses of different host species there can be wide 
variation. For example, mCNIV differs considerably from the human cytomegalovirus 

25 (hCMV) with respect to biological properties, immediate early (EE) gene organization, and 
overall nucleotide sequence. The 235-kbp genome of mCMV also lacks large internal and 
terminal repeat characteristics of hCMV. Accordingly, no isomeric forms of the MCMV 
genome exist (Ebeling, A, et al., (1983), J. Virol. 47, 421-433; Mercer, J. A. et al., (1983), 
Virology 129, 94-106). According to the present invention, it is possible to employ the 

30 promoter region essentially corresponding to a large approx. 2. 1 kb PstI fragment 
described in US 4968615 or any fimctional fragment thereof. In a more preferred 
embodiment, the mCMV promoter fragment employed comprises the transcription start 
site (+0) and extends upstream to about position -500. Surprisingly, such fragment has 
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been found to promote stronger expression than a promoter cassette extending 800 bp 
fiirther upstream beyond position -500. In a most preferred embodiment, a core promoter 
region is employed that extends from the transcription start site upstream but to the Xho I 
restriction site at about position -150 from the natural transcription start site or even 
5 extending but to position —100 upstream from the natural transcriptions start site. It goes 
without saying that the transcription start site might be engineered in order to comprise a 
suitable restriction site for insertion of the recombinant product gene. 

According to the present invention, it is also possible that the first transcription unit that is 
10 under control of the mCMV promoter harbors at least one intron sequence. Such measure 
is well-known in the art for stabilising RNA transcripts and for promoting efficient protein 
synthesis from the corresponding mKNA. For efficient protein synthesis without having 
regard to the claimed effect on transfection rate, it is however not advisable to include the 
first, natural intron of mCMV in the mCMV promoter construct. In contrast to the situation 
15 with hCMV promoter (cf. US 5591639), such natural first intron of mCMV was foimd to 
decrease expression of a recombinant gene from the mCMV promoter and is therefore 
excluded in a further preferred embodiment. 

20 Examples of preferred, possible embodiments of GS marker gene cassettes are given in the 
sequence listings. Seq IDs No. 1 (pEE 15.1 hCMV/GFP + hot spot) +2 (pEE 14.4 
hCMV/GFP) give examples of suitable GS-gene cassettes that are expressed from the 
SV40 (early and late, respectively) promoter, a weak to medium level promoter, further 
comprising an expression cassette for GFP (Green fluorescent protein) that is under control 

25 of the hCNfV promoter. Seq. ID No. 1 describes a GS cDNA sequence described in more 
detail in the figure legend of Fig. 3, under control of the SV40 early promoter. Seq. ID No. 
2 specifies an artifical GS-minigene cassette comprising an intron that is under control of 
the SV40 late promoter. CHO cells are not naturally glutamine auxothropic, therefore 
selection schemes as e.g. described in Cockett et al., 1990, High level expression of tissue 

30 inhibitor of metalloproteinases in Chinese Hamster Ovary (CHO) cells using Glutamine 
synthetase gene amplification, Bio/Technology 8: 662-667, can be applied. Examples of 
suitable transfection methods for CHO cells are equally given therein; it is possible to 
employ e.g. classic calcium phosphate precipitation or more modem lipofection 
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techniques. Transfection rate is routinely defined as the number of positively transfected 
ceUs (transient transfection) or clones (stable transfection after selection period) obtained 
from a pool of cells subjected to transfection. The purported effect of the present object of 
invention can be seen e,g. by transfecting CHO-Kl cells by lipofection (any commercial 

5 reagent and manufacturers protocol) with the plasmids of either Seq. ID No. 3 (pEE 12.4 
hCMV-GFP + SV40 early promoter/GS cDNA) or Seq. ID No. 4 (pEE 12.4 mCMV-GFP 
+ SV40 early promoter/GS cDNA). Transfected cells may be grown in any conventional 
culture medium. The culture mediimi may be a fetal serum-supplemented or serum-free 
mediimi as has been defined above. Preferably, the cell culture medium is a serum- 

10 supplemented medium, more preferably a cell culture medium that has been supplemented 
with at least 1% (v/v) fetal serum, most preferably with at least 5% (v/v) fetal serum such 
as fetal calf serum or fetal bovine serum. In another preferred embodiment, the transfection 
method carried out is electroporatiou. 



15 
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Experiments 

Experiment 1 

Transient and stable expression of GFP vector com prising hot soot sequence in CHO-Kl 
cells 

CHO-Kl cells (ATCC CCL-61) were adapted and cxiltured in normal cell culture medium 
GMEM -S (Gibco, UK) with 10% FCS. - For OS selection, the medium must be 
completely free of glutamine as set forth in table 1 below; this necessitates use of dialysed 
FCS. - All culturing was carried out in shake flask at 36.5 *'C with orbital shaking at 125 
rpm. Lipofectin (Superfectin™ ,Gibco, UK) was used for transfection and green 
fluorescence of transfectant pool was measured in a FACS with excitation at 488 nm. For 
every GS/GFP vector construct, transfection was carried out ind^ndently five limes, all 
data being the average from five independently analyzed pools. Starting with transient 
transfectants 48 h post-transfection, the top scoring 1 0% highly expressing ceUs of the 
viable cell pool in the cell count vs. fluorescence diagram were selected to determine mean 
fluorescence (Fig.l). Viable cell population has been preselected by gating in the Forward 
vs. sideward scatter diagram. 

For generating stable transfectants, GS marker was selected 24 hours post-transfection by 
supplementing the glutamine-fi^e medium with 25 \iM MSX (methionine sulphoximine, 
Crockett et al., ibd.) and continuing ceU culture with regular spUtting of cultures for 26 
days. Note tiie impact of mediimi levels of other amino acids on the potency of MSX for 
selection, see Bebbington et al., US 5827 739. Flurorescence analysis then performed 
again as outlined above (Fig. 2). 

Untransfected cells served as negative control. The hot spot vector (pEE 15.1 'hCMV + hoi 
spot') driving expression of GFP under control of the hCMV promoter comprising the first 
complete intron of CMV is given in Seq. ID No. 1 and essentially is the pEE 15.1 vector 
shown in Fig. 3 into which the GFP sequence was inserted into the Eco RI restriction site 
in the polylinker. pEE 12.4 'hCMV' corresponding to Seq. ED No. 3 is identical to pEE 
15.1 'hCMV + hot spot' except that it does not comprise the 5.1 kb Bam HI fragment 
harboring the IgG 2A sequence. pEE 12.4 served as a vector control. A fiirther vector 
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control pEE 12.4 'hCMV(Kozak-)' was generated by mutating the Kozak sequence of the 
cloning site conmciding with the translation start site (GCCGCCA CCATGG) to a 
frameshifted fiinctional Kozak sequence that UCC ATGG T CCATGG) by primer directed 
mutagensis (Sambrook et al.. Molecular cloning. Cold Spring Harbor 1983), attentuating 
the original Kozak and translation start site. The vector of Seq- ID No. 1 was further 
engineered to delete the 400 bp modulator region of hCMV enhancer portion, deleting the 
enhancer elements upstream of -750 from the transcription start site, giving rise to pEE 
15,1 'hCMV(mod-)/GS cDNA'. By exchange of the GScDNA cassette of pEE 15,1 (s. Fig. 
3) with the GS minigene of pEE 14.4 'hCMV(mod-)' /GFP, corresponding to Seq. ID No. 
2, the vector pEE 15.1 'hCMV(mod-)/GS minigene' was created. Thus all transfected cells 
harbored a plasmid vector comprising the GFP coding sequence. The GS minigene 
contains a single, first mtron of the GS gene and about 1 kb of 3' flanking DNA under the 
control of the SV40 late promoter; the 3' part of the genomic GS DNA is believed to cause 
a higher copy-number of vector DNA and thus of GS in transfected cells (see US4770359, 
Bebbington et al.). Whereas all hCMV vectors employed in the present study express the 
GS maker gene from its cDNA sequence, use of the GS minigene was included as a fiirther 
control in order to exclude potential effects of GS copy number and expression level. 

For generation and expression analysis of stably transfected CHO cells, transfections were 
performed with Imearized hot spot vector pEE 15.1 'hCMV+ hot spot' vector. Sal I 
linearized plasmid was cut in the IgG 2 A comprising sequence portion, free DNA ends 
potentially stimulatmg recombination with genomic regions sharing a certain degree of 
homology with the flanking DNA portions, testing for potential targetting effects of murine 
IgG 2A in hamster CHO cells. Pvu I cut in the bacterial lactamase marker gene and 
therefore could promote but heterologous random recombination. Indeed, the mean 
fluorescence was higher in the Pvu I linearized transfectants showing both some influence 
of vector linearization as well as that targetting to immunoglobuline loci in CHO cells may 
not account for the effect of the present uivention. In addition, the effect of enhanced 
promoter activity was consistingly observed in transiently transfected cell populations, 
nicely correlating with relative strengtii of individual vector constructs. Clearly, genomic 
integration is not involved at this early stage of transfection. 



Fig. 3 shows vector pEE 15,1 of approximately 12 830 bp. A detailed description of the GS 
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marker and the hCMV-p/intron expression cassette can be found in US5827739 and 
US5591639. pEE 15. Lis a possible embodiment of an compression vector according to the 
present invention, except that the DNA sequence coding for the recombinant product 
protein has not yet been inserted in the polylinker site. The complete 13535 bp sequence of 

5 the pEElS.l construct harboring GFP is given in Seq. ID No. 1: Therein, the GFP coding 
sequence was inserted in-frame in the Eco R I restriction site centered at base position 12 
814; the introduction of the unique restriction site harboring the ATG start codon and 
optimizing the Kozak sequence environment of the start codon is described in detail in US 
5591639. Thus, the expression of GFP protein is under control of the hCMV-major 

10 immediate early gene promoter (hCMV-MIE or hCMV for short) immediately followed by 
the first intron of hCMV-MIE gene followed by the Nco I site (s. US 5591 639). 
Polyadenlyation is ensured by the SV40 poly A site further downstream of the polylinker 
insertion site. pEE 15.1 further harbors a cDNA sequence coding for glutamine synthetase 
(GS) from hamster that is under control of the SV 40 early promoter and is followed by an 

15 SV40 intron +poly A sequence. The IgG 2A gene locus or 'hot spot' sequence (hatched 
boxes CHI, Hi, CH2, CH3, Ml, M2 standing for Heavy chain constant region, hinge, 
membrane anchor) is the 5.1 kb BamH I fragment of the murine IgG 2 A locus already 
described in WO 95 175 16 and the references cited therein. Unique restriction sites Pvu I 
and Sal I are shown. 

20 

Experiment 2 

Electroporation of CHO cells with mCMV pi 2.4 -GFP constmct (Seq. ID No.4^ 

Attached CHO-Kl cells (ATCC CCL-61) were cultured in Iscoves' DMEM medium 
25 essentially as described in EP-481 791 comprising 2 mM Glutamine which was further 
supplemented with 10% FCS. Optionally, the G-MEM medium stated in table 1 and 
further comprising 2 mM Glutamine could be used prior to GS marker selection as in 
experiment 1. The cells were detached, pelleted and resuspended twice in sermn-free 
medium, finally at a density of 5.3 xlO^ cells/ml. Per 750 ^il electroporation batch, a total 
30 of 4x10^ cells was electroporated. Electroporation was carried out as described in Methods 
in Molecular Biology, ed. JANickoloff ed, Humana Press 1995, Vol. 48/Chap. 8: Animal 
cell electroporation and electrofusion protocols. pl2.4 mCMV-GFP vector DNA (sequence 
ID No. 4) was linearized. 50^1 (20fig) DNA were added to 750|il cells in electroporation 
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cuvette and electroporate - 300 Volts / 750 fxFd - expecting an electorporation time of 
around 12 - 14 msec. Following electroporadon 800|li1 volume of cells was transferred into 
25 ml of modified Glasgow-MEM (GMEM, Gibco) culture medium for OS selection 
(comprising 10% fetal serum but no glutamine, for details see table 1 ) in a T75 flask. 
5 Divide into 2 x T75 flasks by moving 12,9 mis into a second flask and incubate overnight 
at37^Cinl0%CO2 

On the next day 37.5ml of OS-selection GMEM culture medium supplemented with 10% 
FBS + 333pM MSX (methionine sulphoximine) were added. Thus MSX was jBnally 
10 -25^M. Transfectants were counted after further incubation for 26 days by colony count 
per flask. Upon microscopic inspection in a standard inverted microscope for inspection of 
culture flasks, positive colonies brightly lit up in light green and could be easily counted. 
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The mCMV constmct of Seq. ID No. 4 yielded up to 20 times more foci than did cells that 
15 were transfected in parallel with the hCMV construct of Seq. ID No. 3. The vector 
constructs only differed in the CMV promoter elements driving OFP expression, the 
remaining vector parts of the vectors were identical (includii^ OS-marker; cDNA GS- 
marker cassette of pl2.4) . If cells were diluted out into 96 well plates immediately 
following transfection, many more colonies come up from mCMV transfected cells (>400 
20 colonies) than from hCMV transfected cells (about 45 colonies). 



TABLE 1: Medium for OS selection 



A. Stock Solutions 
25 1 . Double distilled water autoclaved in 400 ml aliquots 

2. 10 X Glasgow MEM (GMEM) without glutamine (GIBCO: 042-2541 in UK). Store at 4*'C. 

3 . 7.5% sodium bicarbonate (GIBCO: 043-05080 in UK; 670-5080 in US). Store at 4**C. 



4. 100 X non-essential amino acids (NEAA) (GIBCO: 043-01 140 in UK; 320-1 140 in US). Store at 
30 4**C. 

5. 100 X Glutamate + Asparagine (G A): add 600 mg glutamic acid and 600 mg asparagines (Sigma). 
Make up to 100 mi in distilled water and sterilize by passing through a sterile 2\iTn filter (Nalgene). 
Store at 4*'C. 



35 



6. lOOmM sodium pyruvate (GIBCO: 043-01360 in UK; 320-1360 in US) 
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7. 50 X nucleosides: 35 mg adenosine 

35 mg guanosine 
35 mg cytidine 
35 mg uridine 
12 mg thymidine 

(each from Sigma). Make up to 100ml with water, filter sterilise and store at -20**C in 10ml aliquots. 

8. Dialysed FCS (GIBCO: 014-06300). Heat inactivate at 56°C for 30 min and store at -20°C. It is 
essential to use dialysed FCS when using GS selection. 

9. Penicilin-streptomycin at 5000 units/ml (P/S: GIBCO: 043-05070 in UK; 600-5070 in US). 

10. 100 mM L.MSX (Sigma): prepare 1 8 mg/ml solution in PBS. Filter sterilise and store at -20°C. 
B.Medium Preparation 

Add the following in the order given using aseptic technique to make GMEM-S medium 



1. Water 


400 ml 


2. 10 X GMEM 


50 ml 


3. Sodium bicarbonate 


18,1 ml 


4. NEAA 


5 ml 


5. G + A 


5ml 


6. Sodium pyruvate 


5mi 


7, Nucleosides 


10 ml 


8. Dialysed FCS 


50 ml 


9. Penicillin-streptomycin 


5 ml 



GMEM-S contains the non-essential amino acids, alanine, aspartate, glycine, proline and serine (100 ^M), 
glutamate and asparagines (500 jiM), and adenosine, guanosine, cytidine and uridine (30 nM), and thymidine 
(10 ]sM). 
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Claims 

1 . CHO cell transfected with an expression vector comprising a promoter tbat is active in 
CHO cells and that is driving expression of a recombinant product protein and further 
comprising a portion from the murine IgG 2 A gene locus DNA which portion is 
enhancing activity of said promoter. 

2. CHO cell according to claim 1, characterized in that tiie vector further comprises a 
transcription unit encoding a selectable marker, preferably a glutamin synthetase (GS) 
marker. 

3. CHO cell according to claim 1 or 2, characterized in the CHO cell is stably transfected. 

4. Method of expressing a recombinant protein, comprising the steps of 

c. culturing a CHO cell transfected with an expression vector comprising a promoter 
active in CHO cells driving expression of a recombinant product protein and fiirther 
comprising the murine IgG 2 A gene loctis DNA or a DNA sequence variant or 
DNA fragment thereof which is enhancing activity of said promoter, and 

(L harvesting the product protein 

5. Method according to claim 4, characterised in that the promoter is a strong viral 
promoter, preferably the hCMV promoter. 

6. Method according to one of claims 4 or 5, characterised in that the IgG 2A gene locus 
portion does lack the natural immunoglobulin promoter. 

7. Mammalian expression vector comprising at least a first transcription unit for a product 
gene Avhich transcription imit is xmder the control of the mCMV promoter, and further 
comprising a second transcription unit comprising a glutamine synthetase (GS) marker 
gene. 

8. CHO cell transfected with the vector of claim 7. 
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Fig. 3 
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LBP30AND31.ST25.txt 
SEQUENCE LISTING 



<110> Lonza Biologies pic. 

<120> Method of expressing recombinant protein in CHO cells 
<130> LBP30+31 

<140> NA 25/01 
<141> 2002-07-. 

<160> 4 

<170> Patentin version 3.1 

<210> 1 

<211> 6679 

<212> DNA 

<213> Hamster sp. 

<220> 

<221> misc_feature 

<223> seq. ID. No. 4: circular plasmid GS vector pl2.4 short mCMV-GFP / 
clone 3 

gaattcattg atcataatca gccataccac atttgtagag gttttacttg ctttaaaaaa 60 

cctcccacac ctccccctga acctgaaaca taaaatgaat gcaattgttg ttgttaactt 120 

gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 180 

agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 240 

tgtctggcgg ccgcgacctg caggcgcaga actggtaggt atggaagatc cctcgagatc 300 

cattgtgctg gcggtaggcg agcagcgcct gcctgaagct gcgggcattc ccagtcagaa 360 

atgagcgcca gtcgtcgtcg gctctcggca ccgaagtgct atgattctcc gccagcatgg 420 

cttcggccag tgcgtcgagc agcgcccgct tgttcctgaa gtgccagtaa agcgccggct 480 

gctgaacccc caaccgttcc gccagtttgc gtgtcgtcag accgtctacg ccgacctcgt 540 
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LBP30AND31.ST25.txt 

tcaacaggtc cagggcggca cggatcactg tattcggctg caactttgtc atgcttgaca 600 

ctttatcact gataaacata atatgtccac caacttatca gtgataaaga atccgcgcca 660 

gcacaatgga tctcgaggtc gagggatctc tagaggatcc atattcgcgg gcatcaccgg 720 

cgccacaggt gcggttgctg gcgcctatat cgccgacatc accgatgggg aagatcgggc 780 

tcgccacttc gggctcatga gcgcttgttt cggcgtgggt atggtggcag gccccgirggc 840 

cgggggactg ttgggcgcca tctccttgca tgcaccattc cttgcggcgg cggtgctcaa 900 

cggcctcaac ctactactgg gctgcttcct aatgcaggag tcgcataagg gagagcgtcg 960 

acctcgggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 1020 

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 1080 

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 1140 

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 1200 

cagttcggtg taggticgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 1260 

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 1320 

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 1380 

tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 1440 

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 1500 

acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 1560 

aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 1620 

aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 1680 

tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 1740 

cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 1800 

catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 1860 

ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 1920 

aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 1980 

ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 2040 

caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 2100 

attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 2160 

agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 2220 

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 2280 

ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 2340 

ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 2400 

gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 2460 

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 2520 

cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 2580 
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gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 
gggttattgi: ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 
ggtitccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 
gacattaacc tataaaaata ggcgtatcac gaggccctga tggctctttg cggcacccat 
cgttcgtaat gttccgtggc accgaggaca accctcaaga gaaaatgtaa tcacactggc 
tcaccttcgg gtgggccttt ctgcgtttat aaggagacac tttatgttta agaaggttgg 
taaattcctt gcggctrtgg cagccaagct agatccggct gtggaatgtg tgtcagttag 
ggtgtggaaa gtccccaggc tccccagcag gcagaagtat gcaaagcatg catctcaatt 
agtcagcaac caggtgtgga aagtccccag gctccccagc aggcagaagt atgcaaagca 
tgcatctcaa ttagtcagca accatagtcc cgcccctaac tccgcccatc ccgcccctaa 
ctccgcccag ttccgcccat tctccgcccc atggctgact aatttttttt atttatgcag 
aggccgaggc cgcctcggcc tctgagctat tccagaagta gtgaggaggc ttttttggag 
gcctaggctt ttgcaaaaag ctagcftggg gccaccgctc agagcacctt ccaccatggc 
cacctcagca agttcccact tgaacaaaaa catcaagcaa atgtacttgt gcctgcccca 
gggtgagaaa gtccaagcca tgtatatctg ggttgatggt actggagaag gactgcgctg 
caaaacccgc accctggact gtgagcccaa gtgtgtagaa gagttacctg agtggaattt 
tgatggctct agtacctttc agtctgaggg ctccaacagt gacatgtatc tcagccctgt 
tgccatgttt cgggacccct tccgcagaga tcccaacaag ctggtgttct gtgaagtttt 
caagtacaac cggaagcctg cagagaccaa tttaaggcac tcgtgtaaac ggataatgga 
catggtgagc aaccagcacc cctggtttgg aatggaacag gagtatactc tgatgggaac 
agatgggcac ccttttggtt ggccttccaa tggctttcct gggccccaag gtccgtatta 
ctgtggtgtg ggcgcagaca aagcctatgg cagggatatc gtggaggctc actaccgcgc 
ctgcttgtat gctggggtca agattacagg aacaaatgct gaggtcatgc ctgcccagtg 
ggaactccaa ataggaccct gtgaaggaat ccgcatggga gatcatctct gggtggcccg 
tttcatcttg catcgagtat gtgaagactt tggggtaata gcaacctttg accccaagcc 
cattcctggg aactggaatg gtgcaggctg ccataccaac tttagcacca aggccatgcg 
ggaggagaat ggtctgaagc acatcgagga ggccatcgag aaactaagca agcggcaccg 
gtaccacatt cgagcctacg atcccaaggg gggcctggac aatgcccgtg gtctgactgg 
gttccacgaa acgtccaaca tcaacgactt ttctgctggt gtcgccaatc gcagtgccag 
catccgcatt ccccggactg tcggccagga gaagaaaggt tactttgaag accgcggccc 
ctctgccaat tgtgacccct ttgcagtgac agaagccatc gtccgcacat gccttctcaa 
tgagactggc gacgagccct tccaatacaa aaactaatta gactttgagt gatcttgagc 
ctttcctagt tcatcccacc ccgccccaga gagatctttg tgaaggaacc ttacttctgt 
ggtgtgacat aattggacaa actacctaca gagatttaaa gctctaaggt aaatataaaa 
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tttttaagtg tataatgtgt taaactactg attctaattg tttgtgtatt ttagattcca 
acctatggaa ctgatgaatg ggagcagtgg tggaatgcct ttaatgagga aaacctgttt 
tgctcagaag aaatgccatc tagtgatgat gaggctactg ctgactctca acattctact 
cctccaaaaa agaagagaaa ggtagaagac cccaaggact ttccttcaga attgctaagt 
tttttgagtc atgctgtgtt tagtaataga actcttgctt gctttgctat ttacaccaca 
aaggaaaaag ctgcactgct atacaagaaa attatggaaa aatattctgt aacctttata 
agtaggcata acagttataa tcataacata ctgttttttc ttactccaca caggcataga 
gtgtctgcta ttaataacta tgctcaaaaa ttgtgtacct ttagcttttt aatttgtaaa 
ggggttaata aggaatattt gatgtatagt gccttgacta gagatcataa tcagccatac 
cacatttgta gaggttttac ttgctttaaa aaacctccca cacctccccc tgaacctgaa 
acataaaatg aatgcaattg ttgttgttaa cttgtttatt gcagcttata atggttacaa 
ataaagcaat agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 
tggtttgtcc aaactcatca atgtatctta tcatgtctgg atctctagct tcgtgtcaag 
gacggtgagg cgcgcctact gagtcattag ggactttcca atgggttttg cccagtacat 
aaggtcaata ggggtgaatc aacaggaaag tcccattgga gccaagtaca ctgagtcaat 
agggactttc cattgggttt tgcccagtac aaaaggtcaa tagggggtga gtcaatgggt 
ttttcccatt attggcacgt acataaggtc aataggggtg agtcattggg tttttccagc 
caatttaatt aaaacgccat gtactttccc accattgacg tcaatgggct attgaaacta 
atgcaacgtg acctttaaac ggtactttcc catagctgat taatgggaaa gtaccgttct 
cgagccaata cacgtcaatg ggaagtgaaa gggcagccaa aacgtaacac cgccccggtt 
tticccctgga aattccatat tggcacgcat tctattggct gagctgcgtt ctacgtgggt 
ataagaggcg cgaccagcgt cggtaccgtc gcagtcttcg gtctgaccac cgtagaacgc 
agaagcttgc cgccaccatg gtgagcaagc agatcctgaa gaacaccggc ctgcaggaga 
tcatgagctt caaggtgaac ctggagggcg tggtgaacaa ccacgtgttc accatggagg 
gctgcggcaa gggcaacatc ctgttcggca accagctggt gcagatccgc gtgaccaagg 
gcgcccccct gcccttcgcc ttcgacatcc tgagccccgc cttccagtac ggcaaccgca 
ccttcaccaa gtaccccgag gacatcagcg acttcttcat ccagagcttc cccgccggct 
tcgtgtacga gcgcaccctg cgctacgagg acggcggcct ggtggagatc cgcagcgaca 
tcaacctgat cgaggagatg ttcgtgtacc gcgtggagta caagggccgc aacttcccca 
acgacggccc cgtgatgaag aagaccatca ccggcctgca gcccagcttc gaggtggtgt 
acatgaacga cggcgtgctg gtgggccagg tgatcctggt gtaccgcctg aacagcggca 
agttctacag ctgccacatg cgcaccctga tgaagagcaa gggcgtggtg aaggacttcc 
ccgagtacca cttcatccag caccgcctgg agaagaccta cgtggaggac ggcggcttcg 
tggagcagca cgagaccgcc atcgcccagc tgaccagcct gggcaagccc ctgggcagcc 
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tgcacgagtg ggtgtaata 

<210> 2 

<211> 8251 

<212> DNA 

<213> Hamster sp. 

<220> 

<221> raisc_feature 

<223> seq. ID. No. 3: circular plasmid GS vector pl2.4 hCMVp-GFP /clone 
13 

gaa«cattg atcataatca gccataccac atttgtagag gttttacttg ctttaaaaaa 
cctcccacac ctccccctga acctgaaaca taaaatgaat gcaattgttg ttgttaactt 
gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 
agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 
tgtctggcgg ccgcgacctg caggcgcaga actggtaggt atggaagatc cctcgagatc 
cattgtgctg gcggtaggcg agcagcgcct gcctgaagct gcgggcattc ccagtcagaa 
atgagcgcca gtcgtcgtcg gctctcggca ccgaagtgct atgattctcc gccagcatgg 
cttcggccag tgcgtcgagc agcgcccgct tgttcctgaa gtgccagtaa agcgccggct 
gctgaacccc caaccgttcc gccagtttgc gtgtcgtcag accgtctacg ccgacctcgt 
tcaacaggtc cagggcggca cggatcactg tattcggctg caactttgtc atgcttgaca 
ctttatcact gataaacata atatgtccac caacttatca gtgataaaga atccgcgcca 
gcacaatgga tctcgaggtc gagggatctc tagaggatcc atattegcgg gcatcaccgg 
cgccacaggt gcggttgctg gcgcctatat cgccgacatc accgatgggg aagatcgggc 
tcgccacttc gggctcatga gcgcttgttt cggcgtgggt atggtggcag gccccgtggc 
cgggggactg ttgggcgcca tctccttgca tgcaccattc cttgcggcgg cggtgctcaa 
cggcctcaac ctactactgg gctgcttcct aatgcaggag tcgcataagg gagagcgtcg 
acctcgggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 
tccccctgga agctccctcg tgcgctctcc tgttccgacG ctgccgctta ccggatacct 
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 
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tacagagttc ttgaagtggt ggectaacta cggctacact agaagaacag tatttggtat 
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 
catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 
ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 
aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 
ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 
caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 
attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 
actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 
ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 
ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 
gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 
atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 
cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 
gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 
gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 
ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 
gacattaacc tataaaaata ggcgtatcac gaggccctga tggctctttg cggcacccat 
cgttcgtaat gttccgtggc accgaggaca accctcaaga gaaaatgtaa tcacactggc 
tcaccttcgg gtgggccttt ctgcgtttat aaggagacac tttatgttta agaaggttgg 
taaattcctt gcggctttgg cagccaagct agatccggct gtggaatgtg tgtcagttag 
ggtgtggaaa gtccccaggc tccccagcag gcagaagtat gcaaagcatg catctcaatt 
agtcagcaac caggtgtgga aagtccccag gctccccagc aggcagaagt atgcaaagca 
tgcatctcaa ttagtcagca accatagtcc cgcccctaac tccgcccatc ccgcccctaa 
ctccgcccag ttccgcccat tctccgcccc atggctgact aatttttttt atttatgcag 
aggccgaggc cgcctcggcc tctgagctat tccagaagta gtgaggaggc ttttttggag 
gcctaggctt ttgcaaaaag ctagcttggg gccaccgctc agagcacctt ccaccatggc 
cacctcagca agttcccact tgaacaaaaa catcaagcaa atgtacttgt gcctgcccca 
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gggtgagaaa gtccaagcca tgtatatctg ggtrtgatggt actggagaag gactgcgctg 3480 

caaaacccgc accctggact gtgagcccaa gtgtgtagaa gagttacctg agtggaattt 3540 

tgatggctct agtacctttc agtctgaggg ctccaacagt gacatgtatc tcagccctgt 3600 

tgccatgttt cgggacccct tccgcagaga tcccaacaag ctggtgttct gtgaagtttt 3660 

caagtacaac cggaagcctg cagagaccaa tttaaggcac tcgtgtaaac ggataatgga 3720 

catggtgagc aaccagcacc cctggtttgg aatggaacag gagtatactc tgatgggaac 3780 

agatgggcac ccttttggti: ggccttccaa tggctttcct gggccccaag gtccgtatta 3840 

ctgtggtgtg ggcgcagaca aagcctatgg cagggatatc gtggaggctc actaccgcgc 3900 

ctgcttgtat gctggggtca agattacagg aacaaatgct gaggtcatgc ctgcccagtg 3960 

ggaactccaa ataggaccct gtgaaggaat ccgcatggga gatcatctct gggtggcccg 4020 

tttcatcttg catcgagtat gtgaagactt tggggtaata gcaacctttg accccaagcc 4080 

cattcctggg aactggaatg gtgcaggctg ccataccaac tttagcacca aggccatgcg 4140 

ggaggagaat ggtctgaagc acatcgagga ggccatcgag aaactaagca agcggcaccg 4200 

gtaccacatt cgagcctacg atcccaaggg gggcctggac aatgcccgtg gtctgactgg 4260 

gttccacgaa acgtccaaca tcaacgactt ttctgctggt gtcgccaatc gcagtgccag 4320 

catccgcatt ccccggactg tcggccagga gaagaaaggt tactttgaag accgcggccc 4380 

ctctgccaat tgtgacccct ttgcagtgac agaagccatc gtccgcacat gccttctcaa 4440 

tgagactggc gacgagccct tccaatacaa aaactaatta gactttgagt gatcttgagc 4500 

ctttcctagt tcatcccacc ccgccccaga gagatctttg tgaaggaacc ttacttctgt 4560 

ggtgtgacat aattggacaa actacctaca gagatttaaa gctctaaggt aaatataaaa 4620 

tttttaagtg tataatgtgt taaactactg attctaattg tttgtgtatt ttagattcca 4680 

acctatggaa ctgatgaatg ggagcagtgg tggaatgcct ttaatgagga aaacctgttt 4740 

tgctcagaag aaatgccatc tagtgatgat gaggctactg ctgactctca acattctact 4800 

cctccaaaaa agaagagaaa ggtagaagac cccaaggact ttccttcaga attgctaagt 4860 

tttttgagtc atgctgtgtt tagtaataga actcttgctt gctttgctat ttacaccaca 4920 

aaggaaaaag ctgcactgct atacaagaaa attatggaaa aatattctgt aacctttata 4980 

agtaggcata acagttataa tcataacata ctgttttttc ttactccaca caggcataga 5040 

gtgtctgcta ttaataacta tgctcaaaaa ttgtgtacct ttagcttttt aatttgtaaa 5100 

ggggttaata aggaatattt gatgtatagt gccttgacta gagatcataa tcagccatac 5160 

cacatttgta gaggttttac ttgctttaaa aaacctccca cacctccccc tgaacctgaa 5220 

acataaaatg aatgcaattg ttgttgttaa cttgtttatt gcagcttata atggttacaa 5280 

ataaagc^at agcatcacaa atttcacaaa taaagcattt ttttcactgc attctagttg 5340 

tggtttgtcc aaactcatca atgtatctta tcatgtctgg atctagcttc gtgtcaagga 5400 

cggtgactgc agtgaataat aaaatgtgtg tttgtccgaa atacgcgttt tgagatttct 5460 
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gtcgccgact 


aaatztcatgt 


cgcgcgatag 


tggtgtttat 


cgccgataga 


gatggcgata 


5S20 


ttggaaaaat 


cgatatttga aaatatrggca 


tattgaaaat 


gtcgccgatg 


tgagtttctg 


5580 


tgtaactgat 


atcgccattt ttccaaaagt 


gatttttggg 


catacgcgat 


atctggcgat 


5640 


agcgcttata 


tcgtttacgg 


gggatggcga 


tagacgactt 


tggtgacttg 


ggcgattctg 


5700 


tgtgtcgcaa 


atatcgcagt 


ttcgatatag 


gtgacagacg 


atatgaggct 


atatcgccga 


5760 


tagaggcgac 


atcaagctgg 


cacatggcca 


atgcatatcg 


atctatacat 


tgaatcaata 


5820 


ttggccatta 


gccatattat 


tcattggtta 


tatagcataa 


atcaatattg 


gctattggcc 


5880 


attgcatacg 


trgtatccat 


atcataatat 


gtacatttat 


attggctcat 


gtccaacatt 


5940 


accgccatgt 


tgacattgat 


tatngactag 


ttattaatag 


taatcaatta 


cggggtcatt 


6000 


agttcatagc 


ccatatatgg 


agttccgcgt 


tacataactt 


acggtaaatg 


gcccgcctgg 


6060 


ctgaccgccc 


aacgaccccc 


gcccattgac 


gtcaataatg 


acgtatgttc 


ccatagtaac 


6120 


gccaataggg 


actttccatt 


gacgtcaatg 


ggtggagtat 


ttacggtaaa 


ctgcccactt 


6180 


ggcagtacat 


caagtgtatc 


atatgccaag 


tacgccccct 


attgacgtca 


atgacggtaa 


6240 


atggcccgcc 


tggcattatg 


cccagtacat 


gaccttatgg 


gactttccta 


cttggcagta 


6300 


catctacgta 


ttagtcatcg 


ctattaccat 


ggtgatgcgg 


ttttggcagt 


acatcaatgg 


6360 


gcgtggatag 


cggtutgact 


cacggggatt 


tccaagtctc 


caccccattg 


acgtcaatgg 


6420 


gagtttgttt 


tggcaccaaa atcaacggga 


ctttccaaaa 


tgtcgtaaca 


actccgcccc 


6480 


attgacgcaa 


atgggcggta ggcgtgtacg 


gtgggaggtc 


tatataagca 


gagctcgttt 


6540 


agtgaaccgt 


cagatcgcct ggagacgcca 


tccacgctgt 


tttgacctcc 


atagaagaca 


6600 


ccgggaccga 


tccagcctcc gcggccggga 


acggtgcatt 


ggaacgcgga 


ttccccgtgc 


6660 


caagagtgac 


gtaagtaccg 


cctatagagt 


ctataggccc 


acccccttgg 


cttcttatgc 


6720 


atgctatact 


gtttttggct tggggtctat 


acacccccgc 


ttcctcatgt 


tataggtgat 


6780 


ggtatagctt 


agcctatagg 


tgtgggttat 


tgaccattat 


tgaccactcc 


cctattggtg 


6840 


acgatacttt 


ccatnactaa 


tccataacat 


ggctctttgc 


cacaactctc 


tttattggct 


6900 


atatgccaat 


acactgtcct 


tcagagactg 


acacggactc 


tgtattttta 


caggatgggg 


6960 


tctcatttat 


tatttacaaa 


ttcacatata 


caacaccacc 


gtccccagtg 


cccgcagttt 


7020 


ttattaaaca 


taacgtggga 


tctccacgcg 


aatctcgggt 


acgtgttccg 


gacatgggct 


7080 


cttctccggt 


agcggcggag 


cttctacatc 


cgagccctgc 


tcccatgcct 


ccagcgactc 


7140 


atggtcgctc 


ggcagctcct 


tgctcctaac 


agtggaggcc 


agacttaggc 


acagcacgat 


7200 


gcccaccacc 


accagtgtgc 


cgcacaaggc 


cgtggcggta 


gggtatgtgt 


ctgaaaatga 


7260 


gctcggggag 


cgggcttgca 


ccgctgacgc 


atttggaaga 


cttaaggcag 


cggcagaaga 


7320 


agatgcaggc 


agcrgagttg 


ttgtgttctg 


ataagagtca 


gaggtaactc 


ccgttgcggt 


7380 


gctgttaacg 


gtggagggca 


gtgtagtctg 


agcagtactc 


gttgctgccg 


cgcgcgccac 


7440 


cagacataat 


agctgacaga 


ctaacagact 


gttcctttcc 


atgggtcttt 


tctgcagtca 


7500 
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ccgtccttga 


cacgaagctt gccgccacca 


tggtgagcaa gcagatcctg 


aagaacaccg 


7560 


a cct:aca.QQa 


gatcatgagc ttcaaggtga 


acctggaggg cgtggtgaac 


aaccacgtgt 


7620 


tcaccatgga 


gggctgcggc aagggcaaca 


tcctgttcgg caaccagctg 


gtgcagatcc 


7680 


acataaccaa 


gggcgccccc ctgcccttcg 


ccttcgacat cctgagcccc 


gccttccagt 


7740 


acggcaaccg 


caccttcacc aagtaccccg 


aggacatcag cgacttcttc 


atccagagct 


7800 


tccccgccgg 


cttcgtgtac gagcgcaccc 


tgcgctacga ggacggcggc 


ctggtggaga 


7860 


tccgcagcga 


catcaacctg atcgaggaga 


tgttcgtgta ccgcgtggag 


tacaagggcc 


7920 


gcaacttccc 


caacgacggc cccgtgatga 


agaagaccat caccggcctg 


cagcccagct 


7980 


tcgaggtggt 


gtacatgaac gacggcgtgc 


tggtgggcca ggtgatcctg 


gtgtaccgcc 


8040 


tgaacagcgg 


caagttctac agctgccaca 


tgcgcaccct gatgaagagc 


aagggcgtgg 


oXOO 


tgaaggactt 


ccccgagtac cacttcatcc 


agcaccgcct ggagaagacc 


tacgtggagg 




acggcggctt 


cgtggagcag cacgagaccg 


ccatcgccca gctgaccagc 


ctgggcaagc 




ccctgggcag 


cctgcacgag tgggtgtaat 


a 




8251 


<210> 3 










<211> 10369 








<212> DNA 










<213> Hamster sp. 









<220> 

<221> misc_feature 

<223> seq. ID. No. 2: circular plasmid GS-minigene vector p 14.4 DeltaM 
odulator Cmod-) hCMVp-GFP /clone 6 

gaattcaltg atcataatca gccataccac atttgtagag gttttacttg ctttaaaaaa 
cctcccacac ctccccctga acctgaaaca taaaatgaat gcaattgttg ttgttaactt 
gtttattgca gctcataatg gttacaaata aagcaatagc atcacaaatt tcacaaataa 
agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg tatcttatca 
tgtctggcgg ccgcgacctg caggcgcaga actggtaggt atggaagatc cctcgagatc 
cattgtgctg gcggtaggcg agcagcgcct gcctgaagct gcgggcattc ccagtcagaa 
atgagcgcca gtcgtcgtcg gctctcggca ccgaagtgct atgattctcc gccagcatgg 
cttcggccag tgcgtcgagc agcgcccgct tgttcctgaa gtgccagtaa agcgccggct 
gctgaacccc caaccgttcc gccagtttgc gtgtcgtcag accgtctacg ccgacctcgt 
tcaacaggtc cagggcggca cggatcactg tattcggctg caactttgtc atgcttgaca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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ctttatcact gataaacata atatgtccac caacttatca gtgataaaga atccgcgcca 
gcacaatgga tctcgaggcc gagggatctc tagaggatcc atattcgcgg gcatcaccgg 
cgccacaggt gcggttgctg gcgcctatat cgccgacatc accgatgggg aagatcgggc 
tcgccacttc gggctcatga gcgcttgttt cggcgtgggt atggtggcag gccccgtggc 
cgggggactg ttgggcgcca tctccttgca tgcaccattc cttgcggcgg cggtgctcaa 
cggcctcaac ctactactgg gctgcttcct aatgcaggag tcgcataagg gagagcgtcg 
acctcgggcc gcgttgctgg cgtfcttcca taggctccgc ccccctgacg agcatcacaa 
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 
gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 
tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat 
ctgcgctccg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 
catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 
ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 
aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 
ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 
caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 
attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 
actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 
ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 
ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 
gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 
atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 
cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 
gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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gggttattgt rtcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 2700 

ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 2760 

gacattaacc tataaaaata ggcgtatcac gaggccctga tggctctttg cggcacccat 2820 

cgttcgtaat gttccgtggc accgaggaca accctcaaga gaaaatgtaa tcacactggc 2880 

tcaccttcgg gtgggccttt ctgcgtttat aaggagacac tttatgttta agaaggttgg 2940 

taaattcctt gcggctttgg cagccaagct agatccagct ttttgcaaaa gcctaggcct 3000 

ccaaaaaagc ctcctrcacta cttctggaat agctcagagg ccgaggcggc ctcggcctct 3060 

gcataaataa aaaaaattag tcagccatgg ggcggagaat gggcggaact gggcggagtt 3120 

aggggcggga tgggcggagt taggggcggg actatggttg ctgactaatt gagatgcatg 3180 

ctttgcatac ttctgcctgc tggggagcct ggggactttc cacacctggt tgctgactaa 3240 

ttgagatgca tgctttgcat acttctgcct gctggggagc ctggggactt tccacaccci: 3300 

aactgacaca cattccacag ggaagctagc ttggaattaa ttccccgccc ccttccaata 3360 

caaaaactaa ttagactttg agtgatcttg agcctttcct agtttttgta ttggaagggc 3420 

tcgtcgccag tctcattgag aaggcatgtg cggacgatgg cttctgtcac tgcaaagggg 3480 

tcacaattgg cagaggggcg gcggtcttca aagtaacctt tcttctcctg ccgagccgag 3540 

aatgggagta gagccgactg cttgattccc acaccaatct cctcgccgct ctcacttcgc 3600 

ctcgttctcg tggctcgtgg ccctgtccac cccgtccatc atcccgccgg ccaccgctca 3660 

gagcaccttc caccatggcc acctcagcaa gttcccactt gaacaaaaac atcaagcaaa 3720 

tgtacttgtg cctgccccag ggtgagaaag tccaagccat gtatatctgg gttgatggta 3780 

ctggagaagg actgcgctgc aaaacccgca ccctggactg tgagcccaag tgtgtagaag 3840 

agttacctga gtggaatttt gatggctcta gtacctttca gtctgagggc tccaacagtg 3900 

acatgtatct cagccctgtt gccatgtttc gggacccctt ccgcagagat cccaacaagc 3960 

tggtgttctg tgaagttttc aagtacaacc ggaagcctgc agagaccaat ttaaggcact 4020 

cgtgtaaacg gataatggac atggtgagca accagcaccc ctggtttgga atggaacagg 4080 

agtatactct gatgggaaca gatgggcacc cttttggttg gccttccaat ggctttcctg 4140 

ggccccaagg tccgtattac tgtggtgtgg gcgcagacaa agcctatggc agggatatcg 4200 

tggaggctca ctaccgcgcc tgcttgtatg ctggggtcaa gattacagga acaaatgctg 4260 

aggtcatgcc tgcccagtgg gaactccaaa taggaccctg tgaaggaatc cgcatgggag 4320 

atcatctctg ggtggcccgt ttcatcttgc atcgagtatg tgaagacttt ggggtaatag 4380 

caacctttga ccccaagccc attcctggga actggaatgg tgcaggctgc cataccaact 4440 

ttagcaccaa ggccatgcgg gaggagaatg gtctgaagta agtagctccc tctggaccat 4500 

ctttattctc atggggtgga aggcctttgt gttagggttg ggaaaagttg gacttctcac 4560 

aaactacatg ccatgctctt cgtgtttgtc ataagcctat cgttttgtac ccgttggaga 4620 

agtgacagta ctctaggaat agaattacag ctgtgatatg ggaaagttgt cacgtaggtt 4680 
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caagcattta aaggtcttta gtaagaacta aatacacata caagcaagtg ggtgacttaa 
ttcttactga tgggaagagg ccagtgatgg gggtcttccc atccaaaaga taattggtat 
tacatgttga ggactggtct gaagcacttg agacataggt cacaaggcag acacagcctg 
catcaagtat ttattggttt cttatggaac tcatgcctgc tcctgccctt gaaggacagg 
tttctagtga caaggtcaga ccctcacctt tactgcttcc accaggcaca tcgaggaggc 
catcgagaaa ctaagcaagc ggcaccggta ccacattcga gcctacgatc ccaagggggg 
gctggacaat gcccgtggtc tgactgggtt ccacgaaacg tccaacatca acgacttttc 
tgctggtgtc gccaatcgca gtgccagcat ccgcattccc cggactgtcg gccaggagaa 
gaaaggttac tttgaagacc gccgcccctc tgccaattgt gacccctttg cagtgacaga 
agccatcgtc cgcacatgcc ttctcaatga gactggcgac gagcccttcc aatacaaaaa 
ctaattagac tttgagtgat cttgagcctt tcctagttca tgccaccccg ccccagctgt 
ctcattgtaa ctcaaaggat ggaatatcaa cggtcttttt attcctcgt:g cccagttaat 
ccttgctttt attggtcaga atagaggagt caagttctta atgcctatac accaacctca 
tttctttrcct atttagcttt ctacgtgggg gtgggagggg tagggagggg taggcgaagg 
gaacgtaacc acatgcttca tctcatcagg aatgccatgt ccagtaggca gagctgccac 
agagtgggtg tatttgtgga ggaggacttt trtcttcagga cagttaaaag agcaggtcca 
ctgcttggat tgacaattcc cctataggta gagagcttgc tagttcttca ggtaaaccaa 
ctttctattc caaatggaag ttaggtgagg agtagtggag gagttaatgc cctccatgaa 
gacagctcag tgtatcacct gagacagatg ggtagcccta ctgtaaaaga aggaaaagtt 
atttctgggt cctccattta taacacaaag cagtagtatt tttatattta aatgtaaaaa 
caaaagttat atatatgata tgtggatata tgtgtatttc taattcagaa accatcctag 
ttactgggtt tgccaagttt gaagagcttg gttaacaaga aaggatctct tgagtagagg 
tgggggtgca gtaccaggaa aggtggttat ctggggctca gcgctttatt actatgtggg 
gtttcccctg cccactctgc aggagcagat gctggacagg tagcagggtg ggacaccagt 
gcttgccacc acctgtccct gtgcttaggc taagatgcat atgtatccac acagagttag 
caggatggag ttggctggtc aacttgaaca ttgttactga taggggtggg tggggtttat 
tttttggtgg gactagcatg tcactaaagc aggccttttg atatattaaa ttttttaaag 
caaaacaagt tcagctttta atcaactttg tagggtttct aactttacag aattgcctgt 
ttgtttcagt gtctccatcc actttgctct tggaggaacg gaggacaggc agacctggag 
ttaaaacatt tgtcattttg tgtcatagtg tctactttct cccagcagaa tattcctttc 
cttcttagga gtcctatgga gttttgtttt tgtttttttt ctattacgat aaacataccc 
cacctccatt ctggcttgcc ctgctgttct ctggttgttt gtgtgctgtc cgcagcaggc 
tgcctgtggt tttctcttgc catgacgact tctaattgcc atgtacagta tgttcagtta 
gataactcct cattgtaaac agactgtaac tgccagagca gcgcttataa atcaacctaa 



4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 

6180 

6240 

6300 

6360 
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6480 
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6600 
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6720 
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catircataag atttcctcti: gacttgtttc tttgtggttrg ggggaggaag aaaaaaaaaa 6780 

gcgtgcagta tttttttgtt ccttcatttc ctatcaaaag aaaggggagt ggttctgttt 6840 

tgtttactcg caaaataagc tagcttatct attggctttt cttttttttt ttttttttaa 6900 

acgggctttt tcttgtacct ataatttggg gtaaggtgtg agagttttta tagttttttg 6960 

agacagggtc ttggtgtata cccttggctg gcctggagct aactatgtag actgggctag 7020 

cctttaactt gcagttctgc tttcaattag ggtttataca tttagtcttg gcaattccta 7080 

gttccacgtt taatctcttt acatttcaaa gcagtgttat ctgaagagtt caggcgcaga 7140 

gtcaattcaa tagagttaca caaaaaccta aaaaacaagt tttaaatacc aagttatgtt 7200 

ggcctggcca cttttcacag ctgtccacaa ctcaatgtga caaggctaca aattggatat 7260 

actagaattt cctggtgatt tggaacccct gcttcatttc ccggaaccag ggcttttggt 7320 

gacagtccta gcttatcaga ttatttaaaa cagttactct tcctgcccct cttcctgaga 7380 

cctttgtcca gctgccatga gccatctaca cagtacttgc ttccctgttg aagtcactga 7440 

aggcacatca gcccaagaca taaaggcttg tcccggattc actagcctgg tgaacttgtg 7500 

gttctctgat gttttgtcct gttttgttgt gatttagtct caaatttccc agcctggttt 7560 

gaaaatctgg gctcccagcc ttcaataagg aggactacag atatgtacga ctgagccttg 7620 

attccagcct catgtttata cgtctgtgct cagctccctg aaggttccag tttgaaactc 7680 

aataatccag gggtcagaaa gtcttgatct tatccccaca gtatggcacc aagcctggct 7740 

gagccttctg acttagtctg ccctgttgct atttaagcac ttttcttcac taggctaaaa 7800 

ataaaaggag cttcctcctt tgccatggcg ctgtgcatga taggaaaagg tagctatcta 7860 

ctagcatatt aactccactg tttirtgcttt gtgtgtttgg tttttgagga agggtctcaa 7920 

ctgtgtatcc ctggctggcc tggccggatc tagcttcgtg tcaaggacgg tgaggcgcgc 7980 

caatattggc tattggccat tgcatacgtt gtatccatat cataatatgt acatttatat 8040 

tggctcatgt ccaacattac cgccatgttg acattgatta ttgactagtt attaatagta 8100 

atcaattacg gggtcattag ttcatagccc atatatggag ttccgcgtta cataacttac 8160 

ggtaaatggc ccgcctggct gaccgcccaa cgacccccgc ccattgacgt caataatgac 8220 

gtatgttccc atagtaacgc caatagggac tttccattga cgtcaatggg tggagtattt 8280 

acggtaaact gcccacttgg cagtacatca agtgtatcat atgccaagta cgccccctat 8340 

tgacgtcaat gacggtaaat ggcccgcctg gcattatgcc cagtacatga ccttatggga 8400 

ctttcctact tggcagtaca tctacgtatt agtcatcgct attaccatgg tgatgcggtt 8460 

ttggcagtac atcaatgggc gtggatagcg gtttgactca cggggatttc caagtctcca 8520 

ccccattgac gtcaatggga gtttgttttg gcaccaaaat caacgggact ttccaaaatg 8580 

tcgtaacaac tccgccccat tgacgcaaat gggcggtagg cgtgtacggt gggaggtcta 8640 

tataagcaga gctcgtttag tgaaccgtca gatcgcctgg agacgccatc cacgctgttt 8700 

tgacctccat agaagacacc gggaccgatc cagcctccgc ggccgggaac ggtgcattgg 8760 



c 'V- * 

WO 2004/009823 



PCT/EP2003/007946 



14/21 



aacgcggatt 
ccccttggct 
cctcatgtta 
accactcccc 
caactctctt 
tatttttaca 
ccccagtgcc 
gtgttccgga 
ccatgcctcc 
acttaggcac 
gtatgtgtct 
taaggcagcg 
ggtaactccc 
tgctgccgcg 
gggtctlittc 
agatcctgaa 
tggtgaacaa 
accagctggt 
tgagccccgc 
acttcttcat 
acggcggcct 
gcgtggagta 
ccggcctzgca 
tgatcctggt 
tgaagagcaa 
agaagaccta 
tgaccagcct 



ccccgngcca 
tcttatgcat 
taggtgatgg 
tattggtgac 
tattggctat 
ggatggggtc 
cgcagttttt 
catgggctct 
agcgacucat 
agcacgatgc 
gaaaatgagc 
gcagaagaag 
gttgcggtgc 
cgcgccacca 
tgcagtcacc 
gaacaccggc 
ccacgtgttc 
gcagatccgc 
cttccagtac 
ccagagcttc 
ggtggagatc 
caagggccgc 
gcccagcttc 
gtaccgcctg 
gggcgtggtg 
cgtggaggac 
gggcaagccc 



LB P 3 OAND 3 1 . ST2 5 . t Xt 

agagtgacgt aagtaccgcc tatagagtct 



gctatactgt ttttggcttg gggtctatac 
tatagcttag cctataggtg tgggttattg 
gatactttcc attactaatc cataacatgg 
atgccaatac actgtccttc agagactgac 
tcatttatta tttacaaatt cacatataca 
attaaacata acgtgggatc tccacgcgaa 
tctccggtag cggcggagct tctacatccg 
ggtcgctcgg cagctccttg ctcctaacag 
ccaccaccac cagtgtgccg cacaaggccg 
tcggggagcg ggcttgcacc gctgacgcat 
atgcaggcag ctgagttgtt gtgttctgat 
tgttaacggt ggagggcagt gnagtctgag 
gacataatag ctgacagact aacagactgt 
gtccttgaca cgaagcttgc cgccaccatg 
ctgcaggaga tcatgagctt caaggtgaac 
accatggagg gctgcggcaa gggcaacatc 
gtgaccaagg gcgcccccct gcccttcgcc 
ggcaaccgca ccttcaccaa gtaccccgag 
cccgccggct tcgtgtacga gcgcaccctg 
cgcagcgaca tcaacctgat cgaggagatg 
aacttcccca acgacggccc cgtgatgaag 
gaggtggtgt acatgaacga cggcgtgctg 
aacagcggca agttctacag ctgccacatg 
aaggacttcc ccgagtacca cttcatccag 
ggcggcttcg tggagcagca cgagaccgcc 
ctgggcagcc tgcacgagtg ggtgtaata 



ataggcccac 

acccccgctt 

accattattg 

ctctttgcca 

acggactctg 

acaccaccgt 

tctcgggtac 

agccctgctc 

tggaggccag 

tggcggtagg 

ttggaagact 

aagagtcaga 

cagtactcgt 

tcctttccat 

gtgagcaagc 

ctggagggcg 

ctgttcggca 

ttcgacatcc 

gacatcagcg 

cgctacgagg 

ttcgtgtacc 

aagaccatca 

gtgggccagg 

cgcaccctga 

caccgcctgg 

atcgcccagc 



8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10369 



<210> 4 

<211> 13535 

<212> DMA 

<213> Hamster sp. 



<220> 
<221> 



misc^feature 
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<223> seq. ID. No. 1: circular plasmid GS + igG 2A hot spot targetting 
vector pEE 15-1 hCMVp-GFP /clone 11 



<400> 4 
gaattcattg 

cctcccacac 

gtttattgca 

agcatttttt 

tgtctggcgg 

cattgtgctg 

atgagcgcca 

cttcggccag 

gctgaacccc 

tcaacaggtc 

ctttatcact 

gcacaatgga 

atcaccggcg 

gatcgggctc 

ccgtggccgg 

tgctcaacgg 

agcgtcgagt 

tgtcctgcag 

gcccagccag 

gaaaattggt 

gcctagctag 

agaaggagat 

aaggtcatga 

gccataccca 

cccagccttc 

tgcccaggta 

ccctgcacta 

acctcttggg 

tctccctgag 

tccagatcag 

gagaggatta 



atcataatca 
ctccccctga 
gcttataatg 
tcactgcatt 
ccgcgacctg 
gcggtaggcg 
gtcgtcgtcg 
tgcgtcgagc 
caaccgttcc 
tagggcggca 
gataaacata 
tctcgaggtc 
ccacaggtgc 
gccacttcgg 
gggactgttg 
cctcaaccta 
cctccgtgtt 
tctgacctct 
tccatcacct 
gaggaaaaca 
accagccagg 
actgactgtg 
tcctctagga 
gggacaaagt 
tctctgcaga 
agtcactaga 
gaggataagc 
tggaccatcc 
ccccatagtc 
ctggtttgtg 
caacagtact 



gccataccac 

acctgaaaca 

gttacaaata 

ctagttgtgg 

caggcgcaga 

agcagcgcct 

gctctcggca 

agcgcccgct 

gccagtttgc 

cggatcactg 

atatgtccac 

gagggatctc 

ggttgctggc 

gctcatgagc 

ggcgccatct 

ctactgggct 

cgaagcgatc 

acaccctcag 

gcaatgtggc 

aggggagtag 

atcagcagcc 

actccctctt 

tagatgtcct 

ccctggtttg 

gcccagaggg 

ccagagctcc 

catgtacaga 

gtcttcatct 

acatgtgtgg 

aacaacgtgg 

ctccgggtgg 



atttgtagag 

taaaatgaat 

aagcaatagc 

tttgtccaaa 

actggtaggt 

gcctgaagct 

ccgaagtgct 

tgttcctgaa 

gtgtcgtcag 

tattcggctg 

caacttatca 

tagaggatcc 

gcctatatcg 

gcttgtttcg 

ccttgcatgc 

gcttcctaat 

cctgtccagt 

cagctcagtg 

ccacccggca 

aggttcacaa 

atcaccaaaa 

ggaaacttct 

tggtcatttc 

gtgccttttc 

cccacaatca 

acccgggaga 

tccatttcca 

tccctccaaa 

tggtggatgt 

aagtacacac 

tcagtgccct 



gttttacttg 

gcaattgttg 

atcacaaatt 

ctcatcaatg 

atggaagatc 

gcgggcattc 

atgattctcc 

gtgccagtaa 

accgtctacg 

caactttgtc 

gtgataaaga 

atattcgcga 

ccgacatcac 

gcgtgggtat 

accattcctt 

gcaggagtcg 

ggtgtgcaca 

actgtaacct 

agcagcacca 

gtgattagtc 

atgggaactt 

aactatgacc 

caggatcatc 

tccttcaaac 

agccctgtcc 

atggtaagtg 

tctctcctca 

gatcaaggat 

gagcgaggat 

agctcagaca 

ccccatccag 



ctttaaaaaa 
ttgttaactt 
tcacaaataa 
tatcttatca 
cctcgagatc 
ccagtcagaa 
gccagcatgg 
agcgccggct 
ccgacctcgt 
atgcttgaca 
atccgcgcca 
atatgccggc 
cgatggggaa 
ggtggcaggc 
gcggcggcgg 
cataagggag 
ccttcccagc 
cgagcacctg 
aggtggacaa 
taaggcctta 
ggcccagaag 
acctaccttc 
ctgacctaag 
ttgagtaacc 
tccatgcaaa 
ctgtaaacat 
tcagcaccta 
gtactcatga 
gacccagatg 
caaacccata 
caccaggact 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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ggalrgagtgg caaggagttc aaatgcaagg tcaacaacaa agacctccca gcgcccatcg 1920 

agagaaccat ctcaaaaccc aaaggtgaga gctgcagcct gactgcatgg gggctgggat 1980 

gggcataagg ataaaggtct gtgtggacag ccttctgctt cagccatgac ctttgtgtat 2040 

gtttctaccc tcacagggtc agtaagagct ccacaggtat atgtcttgcc tccaccagaa 2100 

gaagagatga ctaagaaaca ggtcactctg acctgcatgg tcacagactt catgcctgaa 2160 

gacatttacg tggagtggac caacaacggg aaaacagagc taaactacaa gaacactgaa 2220 

ccagtcctgg actctgatgg ttcttacttc atgtacagca agctgagagt ggaaaagaag 2280 

aactgggtgg aaagaaatag ctactcctgt tcagtggtcc acgagggtct gcacaatcac 2340 

cacacgacta agagcttctc ccggactccg ggtraaatgag ctcagcaccc acaaaactct 2400 

caggtccaaa gagacaccca cactcatctc catgcttccc ttgtataaat aaagcaccca 2460 

ccaatgcctg ggaccatgta aaactgtcct ggttctttcc aaggtataga gcatagctca 2520 

caggctgata tttctggcca gggttggagg acagccttgt ctataggaag agaatgaggt 2580 

ttttgcactg caggactcag agctcattag utatcctgcc ttggagtgtt: ggggcttggc 2640 

tttaggcagt gccttrttcct tgccttccta cgaaccagca gctgccatac atagagataa 2700 

tcctaggaag cctcaaatgg agaaggacac aaacccacct ccctcaggct gttcctctat 2760 

cccggcccca cttctttacc taggggtttc tctgagtcta ttgtggagtt acacatggcc 2820 

aggggcattc cagagaccct tgtcatccat acactcaact caggcagctt: tgcacaaaca 2880 

aagtctgcac acccatacag atggctcact cttgcctgtg ccatgtaggg ctgaggcaca 2940 

tggctcttgc tgccccaagg gagggactat nagatagcca cactcatgct gaatcctggc 3000 

ccattcaaat: tagcctgctg aacaccatcc agtccatata gcacatgtat ccacatgcac 3060 

gtgtgcacaa aacgcattta atacactggg acaacaattc tgtgccctgc acagcaccta 3120 

tatccagcaa tgtatcacca tacacacgac caaaaaaatt caatgcccac gtttctgcca 3180 

tcacaaacag acacatcttt cctctctgtg gccactgcat tatatgctca acacaagacc 3240 

tctgaagcca gatccatctc trggcacctcg gggtcatgct tcaaccccac atgaattatg 3300 

caaaccatag ccataatggt ctgaatcact tcacactggg atgttcccaa gttcaggcaa 3360 

gacgagccac aggctctgct gatgactgaa ggacagcaaa gggtcagtcc agctgtatag 3420 

ccactgttga cctgggtcac aggccctgct gaccctccac cttctcctgt actgaaggaa 3480 

tgaaagatga gacaagcata gagggcactt gaataatcca ggtcactctg aggtccaccc 3540 

aaggcattat tggactcagg tgggaagctg agactggtgt cccagaggga aaggaaggaa 3600 

agcaggcccc ggggagggtc tgctgtccca gtcaggctgg agatctctcc tctgaatcca 3660 

tgcagacatg tctgcctcac agggaatctc tcccagcacc aaccatgttg ggacaaacac 3720 

tgactgtcct ctctgttcag ggctagacct ggatgatgtc tgtgctgagg cccaggacgg 3780 

ggagctggac ggcctctgga cgaccatcac catcttcatc agcctcttcc tgctcagcgt 3840 

gtgctacagc gcctctgtca cactcttcaa ggttggcact gtctcccacc ctctgctgtg 3900 
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atggctacac tgaccacaaa atgtcctctc actcctcccc agatgtagta ggacgttact 
ttgctgcccc tactctgtcc cacacaccat ttcctccatt ccctgagcca tcccacattg 
ttctatgtga ctccacattg tgtcccatac agtctgccct tctgtctctc trggctgtcct 
gcgtgatcct gatactgtct tatgagacca aacctccttg cattccacac tagccttcat 
gaggttcaat gctgtcttac acacaatccc ctcagcctca ccatggctca aggtactctg 
tgagctatcc tcataccatc tccacctcaa ctcccacaat atctccactc tgacccctcc 
catacccagt ctcctacctg tatgaaggga attgaaggag agacaggtcg acctctgtct 
ttcccacaga ttggagggtc tgagcatggg cgtggtctct gactttctct cacttcccca 
caggtaaagt ggatcttctc ctctgtggtg gagctgaagc agacgatctc ccctgactac 
agaaacatga ttgggcaggg agcctaggcc acttcctctg ggatcagaag agcttcctag 
gccctgcaga agcccatcca tcctactgtg cagcctaaca gggaggccac actctagccc 
tatgactctc tgatcagaac tcccatggtc tcctctttgg aggaccacgt gcagtgcagg 
ctttgcccag acctaaacac ttccacagca gtcgccagat atctaactac tccggaccag 
aagaaccatc tccttccaaa ccagcactag ggatctgaga tctcagaatg tttgcctaag 
aagagctgga aatccaggct tcctgtgttc tgctacaagg acatcagcct ggatttgacc 
tggaccacac attttcatct aaatgagttt tccacaaagg acacgtttca gatccttgaa 
tgagacctc-c acatggaaga ccagagtcac tatacccaaa ggtcactctg tatccttgca 
ccagctatac tggacagctt ccttcctggt acttcagtga ccctggctga ggaaaggatc 
tgtgacctca actgtttgga gagcctctgg aagatgtagt cttctcttcc tgctaccacc 
aacatgctgg atctcagatg cagaatrccaa tccacagaca ccactgacca cacaacctga 
agacaaggcc attgccacct ccacagagat gccatccaca ctctgtggag aaataaggag 
tgctttgtgc agcctctgca aagctctggc agggattaga gtatacacac tgagtactga 
ctaggtgacc aggcagaaaa acctccagga gaaggaacaa tgggggagag atgtgaacag 
atagttagaa aaagcatggt gtcacaggtc tgctctgtgg actgatttcc agattggacc 
acctacagca gaaaccatcg gttgcagtgg caatctagga ggaccaacct ggaataggag 
ggctgctgtg gtcaatggag agtagacctg tatctatttc tccactgcct cttatgacca 
ataagaagcc agagtctcca gacagaaaga aagaaagaaa gaaagaaaga aagaaagaaa 
gaaagagaga gagagagaga gagagagaga gaggaaggaa ggaaggaagg aaggaaggaa 
ggaaggaagg aggaggagga ggaggaggag gaggaggaga gagagagaga gagagagaga 
gagagagaga gagcaccagc titttctgtga ctggaaggaa atgcttagag agcttggatc 
tttaaagctt cttttttcta gagaccatga atgtctttgt tctctctctc tctctctctc 
tctctctctc tctctctctc tctctctgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgcg 
tgcatgcacg ctattgtttt ggcatttgaa acaataaaac attcttttaa tattctgtat 
ctcatggttc cccttctgtg tggatcagcc ctaacaccca ggaacagggg acaataaaca 



3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
-5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
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gaccacagcc atgtacagcc ttctacctcc cttctggttc tgacctccca gaggtccctc 
agtgggcccc tcacagctgg gtttcttccc tggcagtgcc accaagagct caggcacctc 
tgagctggag gctgtcctga tgccataggc aggctatgga gcagagatga tgaccacggt 
ggactccagg tgagccaggc aaagcctccc atgccagaag agaagcgtgt ggtactcact 
ggcctcgggc tgctacggat tcagcaaaga gcatggatcg cttcgaagcc tccaagctcg 
acctcgggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 
aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 
tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 
gtccgccttt ctcccttcgg gaagcgtggc gctttctcaa tgctcacgct gtaggtatct 
cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 
cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 
atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 
tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 
ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 
acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 
aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 
aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 
tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 
cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 
catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 
ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 
aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 
ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 
caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttic 
attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 
agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 
actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 
ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 
ttgctcttgc ccggcgtcaa cacgggataa taccgcgcca catagcagaa ctttaaaagt 
gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 
atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 
cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 
gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 
gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 



6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
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6600 
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ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 
gacattaacc tataaaaata ggcgtatcac gaggccctga tggctctttg cggcacccat 

cgttcgtaat gttccgtggc accgaggaca accctcaaga gaaaatgtaa tcacactggc 8160 

tcaccttcgg gtgggccttt ctgcgtttat aaggagacac tttatgttta agaaggttgg 8220 

taaattcctt: gcggctttgg cagccaagct agagatccgg ctgtggaatg tgtgtcagtt 8280 

agggtgtgga aagtccccag gctccccagc aggcagaagt atgcaaagca tgcatctcaa 8340 

ttagtcagca accaggtgtg gaaagtcccc aggctcccca gcaggcagaa gtatgcaaag 8400 

catgcatctc aattagtcag caaccatagt cccgccccta actccgccca tcccgcccct 8460 

aactccgccc agttccgccc attctccgcc ccatggctga ctaatttttt ttatttatgc 8520 

agaggccgag gccgcctcgg cctctgagct attccagaag tagtgaggag gcttttttgg 8580 

aggcctaggc ttttgcaaaa agctagcttg gggccaccgc tcagagcacc ttccaccatg 8640 

gccacctcag caagttccca cttgaacaaa aacatcaagc aaatgtactt gtgcctgccc 8700 

cagggtgaga aagtccaagc catgtatatc tgggttgatg gtactggaga aggactgcgc 8760 

tgcaaaaccc gcaccctgga ctgtgagccc aagtgtgtag aagagttacc tgagtggaat 8820 

tttgatggct ctagtacctt tcagtctgag ggctccaaca gtgacatgta tctcagccct 8880 

gttgccatgt ttcgggaccc cttccgcaga gatcccaaca agctggtgtt ctgtgaagtt 8940 

ttcaagtaca accggaagcc tgcagagacc aatttaaggc actcgtgtaa acggataatg 9000 

gacatggtga gcaaccagca cccctggttt ggaatggaac aggagtatac tctgatggga 9060 

acagatgggc acccttttgg ttggccttcc aatggcttcc ctgggcccca aggtccgtat 9120 

tactgtggtg tgggcgcaga caaagcctat ggcagggata tcgtggaggc tcactaccgc 9180 

gcctgcttgt atgctggggt caagattaca ggaacaaatg ctgaggtcat gcctgcccag 9240 

tgggaactcc aaataggacc ctgtgaagga atccgcatgg gagatcatct ctgggtggcc 9300 

cgtttcatct tgcatcgagt atgtgaagac tttggggtaa tagcaacctt tgaccccaag 9360 

cccattcctg ggaactggaa tggtgcaggc tgccatacca actttagcac caaggccatg 9420 

cgggaggaga atggtctgaa gcacatcgag gaggccatcg agaaactaag caagcggcac 9480 

cggtaccaca ttcgagccta cgatcccaag gggggcctgg acaatgcccg tggtctgact 9540 

gggttccacg aaacgtccaa catcaacgac ttttctgctg gtgtcgccaa tcgcagtgcc 9600 

agcatccgca ttccccggac tgtcggccag gagaagaaag gttactttga agaccgcggc 9660 

ccctctgcca attgtgaccc ctttgcagtg acagaagcca tcgtccgcac atgccttctc 9720 

aatgagactg gcgacgagcc cttccaatac aaaaactaat tagactttga gtgatcttga 9780 
gcctttccta gttcatccca ccccgcccca gagagatctt tgtgaaggaa ccttacttct 
gtggtgtgac ataattggac aaactaccta cagagattta aagctctaag gtaaatataa 
aattttrtaag tgtataatgt gttaaactac tgattctaat tgtttgtgta ttttagattc 

caacctatgg aactgatgaa tgggagcagt ggtggaatgc ctttaatgag gaaaacctgt 10020 
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tttgctcaga 
ctcctccaaa 
gttttttgag 
caaaggaaaa 
taagtaggca 
gagtgtctgc 
aaggggttaa 
accacattrtg 
aaacataaaa 
aaataaagca 
tgtggtutgt 
aggacggtga 
ttctgtcgcc 
gatattggaa 
tctrgtgtaac 
cgatagcgct 
tctgtgtgtc 
ccgatagagg 
aatattggcc 
ggccattgca 
cattaccgcc 
cattagtuca 
ctggctigacc 
taacgccaat 
acttggcagt 
gtaaatggcc 
agtacatcta 
atgggcgtgg 
atgggagttt 
ccccattgac 
gtttagtgaa 
gacaccggga 
gtgccaagag 
atgcatgcta 



agaaatgcca 
aaagaagaga 
tcatgctgtg 
agctgcactg 
taacagttat 
tattaataac 
taaggaatat 
tagaggtttt 
tgaatgcaat 
atagcatcac 
ccaaactcat 
ctgcagtgaa 
gactaaattc 
aaatcgatat 
tgatatcgcc 
tatatcgttt 
gcaaatancg 
cgacatcaag 
attagccata 
tacgttgtat 
atgttgacat 
tagcccatat 
gcccaacgac 
agggactttc 
acatcaagtg 
cgcctggcat 
cgtattagtc 
atagcggttt 
gtttitggcac 
gcaaatgggc 
ccgtcagatc 
ccgatccagc 
tgacgtaagt 
tactgttttt 
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tctagtgatg atgaggctac tgctgactct 
aaggtagaag accccaagga ctttccttca 
tttagtaata gaactcttgc ttgctttgct 
ctatacaaga aaattatgga aaaatattct 
aatcataaca tactgttttt tcttactcca 
tatgctcaaa aattgtgtac ctttagcttt 
ttgatgtata gtgccttgac tagagatcat 
acttgcttta aaaaacctcc cacacctccc 
tgttgttgtt aacttgttta ttgcagctta 
aaatttcaca aataaagcat ttttttcact 
caatgtatct tatcatgtct ggatctctag 
taataaaatg tgtgtttgtc cgaaatacgc 
atgtcgcgcg atagtggtgt ttatcgccga 
ttgaaaatat ggcatattga aaatgtcgcc 
atttttccaa aagtgatttt tgggcatacg 
acgggggatg gcgatagacg actttggtga 
cagtttcgat ataggtgaca gacgatatga 
ctggcacatg gccaatgcat atcgatctat 
ttattcattg gttatatagc ataaatcaat 
ccatatcata atatgtacat ttatattggc 
tgattattga ctagttatta atagtaatca 
atggagttcc gcgttacata acttacggta 
ccccgcccat tgacgtcaat aatgacgtat 
cattgacgtc aatgggtgga gtatttacgg 
tatcatatgc caagtacgcc ccctattgac 
tatgcccagt acatgacctt atgggacttt 
atcgctatta ccatggtgat gcggttttgg 
gactcacggg gatttccaag tctccacccc 
caaaatcaac gggactttcc aaaatgtcgt 
ggtaggcgtg tacggtggga ggtctatata 
gcctggagac gccatccacg ctgttttgac 
ctccgcggcc gggaacggtg cattggaacg 
accgcctata gagtctatag gcccaccccc 
ggcttggggt ctatacaccc ccgcttcctc 



caacattcta 
gaattgctaa 
atttacacca 
gtaaccttta 
cacaggcata 
ttaatttgta 
aatcagccat 
cctgaacctg 
taatggttac 
gcattctagt 
cttcgtgtca 
gttttgagat 
tagagatggc 
gatgtgagtt 
cgatatctgg 
cttgggcgat 
ggctatatcg 
acattgaatc 
attggctatt 
tcatgtccaa 
attacggggt 
aatggcccgc 
gttcccatag 
taaactgccc 
gtcaatgacg 
cctacttggc 
cagtacatca 
attgacgtca 
aacaactccg 
agcagagctc 
ctccatagaa 
cggattcccc 
ttggcttctt 
atgttatagg 
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tgatggtana gcttagccta taggtgtggg ttattgacca ttattgacca rtcccctatt 
ggtgacgata ctttccatta ctaatccata acatggctct ttgccacaac tctctrttatt 
ggctatatgc caatacactg tccttcagag actgacacgg actctgtatt tttacaggat 
ggggtctcat ttattattta caaattcaca tatacaacac caccgtcccc agtgcccgca 12300 
gtttttatta aacataacgt gggatctcca cgcgaatctc gggtacgtgt tccggacatg 12360 
ggetcttctc cggtagcggc ggagcttcta catccgagcc ctgctcccat gcctccagcg 
actcatggtc gctcggcagc tccttgctcc taacagtgga ggccagactt aggcacagca 
cgatgcccac caccaccagt gtgccgcaca aggccgtggc ggtagggtat gtgtctgaaa 
atgagctcgg ggagcgggct tgcaccgctg acgcatttgg aagacttaag gcagcggcag 
aagaagatgc aggcagctga gttgttgtgt tctgataaga gtcagaggta actcccgttg 
cggtgctgtt aacggtggag ggcagtgtag tctgagcagt actcgttgct gccgcgcgcg 
ccaccagaca taatagctga cagactaaca gactgttcct ttccatgggt cttttctgca 
gtcaccgtcc ttgacacgaa gcttgccgcc accatggtga gcaagcagat cctgaagaac 
accggcctgc aggagatcat gagcttcaag gtgaacctgg agggcgtggt gaacaaccac 
gtgttcacca tggagggctg cggcaagggc aacatcctgt tcggcaacca gctggtgcag 
atccgcgtga ccaagggcgc ccccctgccc ttcgccttcg acatcctgag ccccgccttc 
cagtacggca accgcacctt caccaagtac cccgaggaca tcagcgactt cttcatccag 
agcttccccg ccggcttcgt gtacgagcgc accctgcgct acgaggacgg cggcctggtg 
gagatccgca gcgacatcaa cctgatcgag gagatgttcg tgtaccgcgt ggagtacaag 
ggccgcaact tccccaacga cggccccgtg atgaagaaga ccatcaccgg cctgcagccc 
agcttcgagg tggtgtacat gaacgacggc gtgctggtgg gccaggtgat cctggtgtac 
cgcctgaaca gcggcaagtt ctacagctgc cacatgcgca ccctgatgaa gagcaagggc 
gtggtgaagg acttccccga gtaccacttc atccagcacc gcctggagaa gacctacgtg 
gaggacggcg gcttcgtgga gcagcacgag accgccatcg cccagctrgac cagcctgggc 
aagcccctgg gcagcctgca cgagtgggtg taata 
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12600 
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